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PRODUCTION OF LYSOSOMAL ENZYMES IN 
PLANT-BASED EXPRESSION SYSTEMS 

This application is a continuation-in-part of provisional 
application serial number 60/003,737, filed September 14, 
1995, the disclosure of which is incorporated herein in its 
5 entirety. 

This invention was made with United States government support 
under grant nos. NS32369 and DK48570 awarded by the National 
Institutes of Health. The government has certain rights in 
the invention. 

1. FIELD OF THE INVENTION 

10 The present invention relates to the production of human 

and animal lysosomal enzymes in plants comprising expressing 
the genetic coding sequence of a human or animal lysosomal 
enzyme in a plant expression system. The plant expression 
system provides for post-translational modification and 

15 processing to produce recombinant protein having enzymatic 
activity. 

The invention is demonstrated herein by working examples 
in which transgenic tobacco plants produce a modified human 
glucocerebrosidase (hGC) and a human a-L-iduronidase (IDUA) , 
20 both of which are enzymatically active. 

The recombinant lysosomal enzymes produced in accordance 
with the invention may be used for a variety of purposes 
including but not limited to enzyme replacement therapy for 
the therapeutic treatment of lysosomal storage diseases, 
research for development of new approaches to medical 
treatment of lysosomal storage diseases, and industrial 
processes involving enzymatic substrate hydrolysis. 



25 



30 



35 



2. BACKGROUND OF THE INVENTION 

2.1. LYSOSOMAL STORAGE DISEASES 

Lysosomes, which are present in all animal cells, are 
acidic cytoplasmic organelles that contain an assortment of 
hydrolytic enzymes. These enzymes function in the 
degradation of internalized and endogenous macromolecular 
substrates. When there is a lysosomal enzyme deficiency, the 
deficient enzyme's undegraded substrates gradually accumulate 
within the lysosomes causing a progressive increase in the 
size and number of these organelles within the cell. This 
accumulation within the cell eventually leads to malfunction 



WO 97/1 0353 PCT/US96/1 4730 

of the organ and to the gross pathology of a lysosomal 
storage disease, with the particular disease depending on the 
particular enzyme deficiency. More than thirty distinct, 
inherited lysosomal storage diseases have been characterized 
5 in humans. 

A few examples of lysosomal storage diseases (and their 
associated deficient enzymes) include Fabry disease 
(a-galactosidase) , Farber disease (ceramidase) , Gaucher 
disease (glucocerebrosidase) , G ml gangliosidosis 

10 (B-galactosidase) , Tay-Sachs disease (B -hexosaminidase) , 
Niemann-Pick disease (sphingomyelinase) , Schindler disease 
(or-N-acetylgalactosaminidase) , Hunter syndrome (iduronate- 
2-sulf atase) , Sly syndrome (B-glucuronidase) , Hurler and 
Hurler/Scheie syndromes (iduronidase) , and I-Cell/San Filipo 

15 syndrome (mannose 6-phosphate transporter) . 

One proven treatment for lysosomal storage diseases is 
enzyme replacement therapy in which an active form of the 
enzyme is administered directly to the patient. However, 
abundant, inexpensive and safe supplies of therapeutic 

20 lysosomal enzymes are not commercially available for the 
treatment of any of the lysosomal storage diseases. 

2.1.1. GAUCHER DISEASE AND TREATMENT 

Gaucher disease is the most common lysosomal storage 
25 disease in humans, with the highest frequency encountered in 
the Ashkenazi Jewish population. About 5,000 to 10,000 
people in the United States are afflicted with this disease 
(Grabowski, 1993, Adv. Hum. Genet. 21:377-441). Gaucher 
disease results from a deficiency in glucocerebrosidase (hGC; 
30 glucosylceramidase; acid 0-glucosidase; EC 3.2.1.45). This 
deficiency leads to an accumulation of the enzyme's 
substrate, glucocerebroside, in reticuloendothelial cells of 
the bone marrow, spleen and liver, resulting in significant 
skeletal complications such as bone marrow expansion and bone 
35 deterioration, and also hypersplenism, hepatomegaly, 

thrombocytopenia, anemia and lung complications (Grabowski, 
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1993, supra; Lee, 1982, Prog. Clin. Biol. Res. 95:177-217; 
Brady et ai . , 1965, Biochem. Biophys. Res. Comm. 18:221-225). 

hGC replacement therapy has revolutionized the medical 
care and management of Gaucher disease, leading to 
5 significant improvement in the quality of life of many 
Gaucher patients (Pastores et al . , 1993, Blood 82:408-416; 
Fallet et al . , 1992, Pediatr . Res. 31:496-502). Studies have 
shown that regular, intravenous administration of 
specifically modified hGC (Ceredase™, Genzyme Corp.) can 

i0 result in dramatic improvements and even reversals in the 
hepatic, splenic and hematologic manifestations of the 
disease (Pastores et al . , 1993, supra; Fallet et al . , 1992, 
supra; Figueroa et al . , 1992, N. Eng. J. Med. 327:1632-1636; 
Barton et al . , 1991, N. Eng. J. Med. 324:1464-1470; Beutler 

15 et al., 1991, Blood 78:1183-1189). Improvements in 

associated skeletal and lung complications are possible, but 
require larger doses of enzyme over longer periods of time. 

Despite the benefits of hGC replacement therapy, the 
source and high cost of the enzyme seriously restricts its 

20 availability. Until recently, the only commercial source of 
purified hGC has been from pooled human placentae, where ten 
to twenty kilograms (kg) of placentae yield only 1 milligram 
(mg) of enzyme. From five hundred to two thousand kilograms 
of placenta (equivalent to 2,000-8,000 placentae) are 

25 required to treat each patient every two weeks. Current 
costs for HGC replacement therapy range from $55 to $220/kg 
patient body weight every two weeks, or from $70,000 to 
$300,000/year for a 50 kg patient. Since the need for 
therapy essentially lasts for the duration of a patient's 

30 life, costs for the enzyme alone may exceed $15,000,000 
during 3 0 to 7 0 years of therapy. 

A second major problem associated with treating Gaucher 
patients with glucocerebrosidase isolated from human tissue 
(and perhaps even from other animal tissues) is the risk of 

35 exposing patients to infectious agents which may be present 
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in the pooled placentae, e.g., human immuno-def iciency virus 
(HIV) , hepatitis viruses, and others. 

Accordingly, a new source of hGC is needed to 
effectively reduce the cost of treatment and to eliminate the 
5 risk of exposing Gaucher patients to infectious agents. 

2.1*2. HURLER SYNDROME AND TREATMENT 

Hurler syndrome is the most common of the group of human 
lysosomal storage disorders known as the mucopolysac- 

10 charidoses (MPS) involving an inability to degrade dermatan 
sulfate and heparan sulfate. Hurler patients are deficient 
in the lysosomal enzyme, cr-L-iduronidase (IDUA) , and the 
resulting accumulation of glucosaminoglycans in the lysosomes 
of affected cells leads to a variety of clinical 

15 manifestations (Neufeld & Ashwell, 1980, The Biochemistry of 
Glycoproteins and Proteoglycans , ed. W.J, Lennarz, Plenum 
Press, NY; pp. 241-266) including developmental delay, 
enlargement of the liver and spleen, skeletal abnormalities, 
mental retardation, coarsened facial features, corneal 

2 0 clouding, and respiratory and cardiovascular involvement. 
Hurler/Scheie syndrome (MPS I H/S) and Scheie syndrome (MPS 
IS) represent less severe forms of the disorder but also 
involve deficiencies in IDUA. Molecular studies on the genes 
and cDNAs of MPS I patients has led to an emerging 

25 understanding of genotype and clinical phenotype (Scott et 

aJ., 1990, Am. J. Hum. Genet. 47:802-807). In addition, both 
a canine and feline form of MPS I have been characterized 
(Haskins et al . , 1979, Pediat. Res. 13:1294-1297; Haskins and 
Kakkis, 1995, Am. J. Hum. Genet. 57:A39 Abstr. 194; Shull et 

30 al., 1994, Proc. Natl. Acad. Sci . USA, 91:12937-12941) 

providing an effective In vivo model for testing therapeutic 
approaches. 

The efficacy of enzyme replacement in the canine model 
of Hurler syndrome using human IDUA generated in CHO cells 
35 was recently reported (Kakkis et al., 1995, Am. J. Hum. 

Genet. 57:A39 (Abstr.); Shull et al . , 1994, supra). Weekly 
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doses of approximately 1 mg administered over a period of 3 
months resulted in normal levels of the enzyme in liver and 
spleen, lower but significant levels in kidney and lungs and 
very low levels in brain, heart, cartilage and cornea (Shull 
5 et al . , 1994, supra. Tissue examinations showed 

normalization of lysosomal storage in the liver, spleen and 
kidney, but no improvement in heart, brain and corneal 
tissues. One dog was maintained on treatment for 13 months 
and was clearly more active with improvement in skeletal 

10 deformities, joint stiffness, corneal clouding and weight 
gain (Kakkis et al . , 1995, supra. A single higher-dose 
experiment was quite promising and showed detectable IDUA 
activity in the brain and cartilage in addition to tissues 
which previously showed activity at the lower does. 

15 Additional higher-dose experiments and trials involving 

longer administration are currently limited by availability 
of recombinant enzyme. These experiments underscore the 
potential of replacement therapy for Hurler patients and the 
severe constraints on both canine and human trials due to 

20 limitations in recombinant enzyme production using current 
technologies . 

2.2. BIOSYNTHESIS OF LYSOSOMAL ENZYMES 

Soluble lysosomal enzymes share initial steps of 
25 biosynthesis with secretory proteins, i.e., synthesis on the 
ribosome, binding of the N-terminal signal peptide to the 
surface of the rough endoplasmic reticulum (ER) , transport 
into the lumen of the ER where the signal peptide is cleaved, 
and addition of oligosaccharides to specific asparagine 
30 residues (N-linked) , followed by further modifications of the 
nascent protein in the Golgi apparatus (von Figura and 
Hasilik, 1986, Annu. Rev. Biochem. 55:167-193). The N-linked 
oligosaccharides can be complex, diverse and heterogeneous, 
and may contain high-mannose residues. The proteins undergo 
35 further processing in a post-ER, pre-Golgi compartment and in 
the cis-Golgi to form either an N-linked mannose 6-phosphate 
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(M-6-P) oligosaccharide-dependent or N-linked M-6-P 
oligosaccharide- independent recognition signal for lysosomal 
localized enzymes (Kornfeld & Mellman, 1989, Ann. Rev. Cell 
Biol., 5:483-525; Kaplan et al . , 1977, Proc. Natl. Acad. Sci. 
5 USA 74:2026). The presence of the M-6-P recognition signal 
results in the binding of the enzyme to M-6-P receptors 
(MPR) . These bound enzymes remain in the cell, are 
eventually packaged into lysosomes, and are thus segregated 
from proteins targeted for secretion or to the plasma 
10 membrane. 

Although many lysosomal enzymes are soluble and are 
transported to lysosomes by MPRs, integral membrane and 
membrane-associated proteins (notably hGC) are targeted and 
transported to lysosomes independent of the M-6-P/MPR system 

15 (Kornfeld & Mellman, 1989, Erickson et al . , 1985). hGC does 
not become soluble after translation, but instead becomes 
associated with the lysosomal membrane by means which have 
not been elucidated (von Figura & Hasilik, 1986, Annu. Rev. 
Biochem. 55:167-193; Kornfeld and Mellman, 1989, Annu. Rev. 

20 Cell Biol. 5:483-525). 

hGC is synthesized as a single polypeptide (58 kDa) with 
a signal sequence (2 kDa) at the amino terminus. The signal 
sequence is co-translationally cleaved and the enzyme is 
glycosylated with a heterogeneous group of both complex and 

25 high-mannose oligosaccharides to form a precursor. The 

glycans are predominately involved in protein conformation. 
The "high mannose" precursor, which has a molecular weight of 
63 Kda, is post-translationally processed in the Golgi to a 
66 Kda intermediate, which is then further modified in the 

30 lysosome to the mature enzyme having a molecular weight of 59 
Kda (Jonsson et al . , 1987, Eur. J. Biochem. 164:171; Erickson 
et aJ., 1985, J. Biol. Chem. , 260:14319). 

The mature hGC polypeptide is composed of 497 amino 
acids and contains five N-glycosylation amino acid consensus 

35 sequences (Asn-X-Ser/Thr) . Four of these sites are normally 
glycosylated. Glycosylation of the first site is essential 
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for the production of active protein. Both high-mannose and 
complex oligosaccharide chains have been identified 
(Berg-Fussman et al - , 1993 , J. Biol. Chem. 268:14861-14866). 
hGC from placenta contains 7% carbohydrate, 20% of which is 
5 of the high-mannose type (Grace & Grabowski, 1990, Biochem. 
Biophys. Res. Comm. 168:771-777). Treatment of placental hGC 
with neuraminidase (yielding an asialo enzyme) results in 
increased clearance and uptake rates by rat liver cells with 
a concomitant increase in hepatic enzymatic activity (Furbish 

10 et al . , 1981, Biochim. Biophys. Acta 673:425-434). This 
glycan-modif ied placental hGC is currently used as a 
therapeutic agent in the treatment of Gaucher 's disease. 
Biochemical and site-directed mutagenesis studies have 
provided an initial map of regions and residues important to 

15 folding, activator interaction, and active site location 
(Grace et al . , 1994, J. Biol. Chem. 269:2283-2291). 

The complete complementary DNA (cDNA) sequence for hGC 
has been published (Tsuji et al . , 1986, J. Biol. Chem. 
261:50-53; Sorge et al . , 1985, Proc. Natl. Acad. Sci. USA 

20 82:7289-7293), and E. coli containing the hGC cDNA sequence 
cloned from fibroblast cells, as described (Sorge et al . , 
1985, supra), is available from the American Type Culture 
Collection (ATCC) (Accession No. 65696) . 

Recombinant methodologies have the potential to provide 

25 a safer and less expensive source of lysosomal enzymes for 
replacement therapy. However, production of active enzymes, 
e.g., hGC, in a heterologous system requires correct 
targeting to the ER, and appropriate N-l inked glycosylation 
at levels or efficiencies that avoid ER-based degradation or 

3 0 aggregation. Since mature lysosomal enzymes must be 

glycosylated to be active, bacterial systems cannot be used. 
For example, hGC expressed in E. coll is enzymatically 
inactive (Grace & Grabowski, 1990, supra) . 

Active monomers of hGC have been purified from insect 

35 cells (Sf9 cells) and Chinese hamster ovary (CHO) cells 

infected or transfected, respectively, with hGC cDNA (Grace & 
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Grabowski, 1990, supra; Grabowski et al . , 1989, Enzyme 
41:131-142). A method for producing recombinant hGC in CHO 
cell cultures and in insect cell cultures was recently 
disclosed in U.S. Patent No. 5,236,838. Recombinant hGC 
5 produced in these heterologous systems had an apparent 

molecular weight ranging from 64 to 73 kDa and contained from 
5 to 15% carbohydrate (Grace & Grabowski, 1990, supra; Grace 
et al., 1990, J. Biol. Chem. 265:6827-6835). These 
recombinant hGCs had kinetic properties identical to the 

10 natural enzyme isolated from human placentae, as based on 
analyses using a series of substrate and transition state 
analogues, negatively-charged lipid activators, protein 
activators (saposin C) , and mechanism-based covalent 
inhibitors (Grace et al . , 1994, supra; Berg-Fussman et al . , 

15 1993, supra; Grace et al . , 1990, J. Biol. Chem. 

265:6827-6835; Grabowski et al . , 1989, supra). However, both 
insect cells and CHO cells retained most of the enzyme rather 
than secreting it into the medium, significantly increasing 
the difficulty and cost of harvesting the pure enzyme 

20 (Grabowski et al . , 1989, supra). 

Accordingly, a recombinant system is needed that can 
produce human or animal lysosomal enzymes in an active form 
at lower cost, and that will be appropriately targeted for 
ease of recovery. 

25 

2.3. MAMMALIAN LY8Q80MES VBR8UB PLANT VACUOLES 

Because plants are eukaryotes, plant expression systems 
have advantages over prokaryotic expression systems, 
particularly with respect to correct processing of eukaryotic 

30 gene products. However, unlike animal cells, plant cells do 
not possess lysosomes. Although the plant vacuole appears 
functionally analogous to the lysosome, plants do not contain 
MPRs (Chrispeels, 1991, Ann. Rev. PI. Phys. PI. Mol. Biol. 
42:21-53; Chrispeels and Tague, 1991, Intl. Rev. Cytol. 

35 125:1-45), and the mechanisms of vacuolar targeting can 

differ significantly from those of lysosomal targeting. For 
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example, the predominant mechanism of vacuolar targeting in 
plants does not appear to be glycan-dependent , but appears to 
be based instead on C- or N-terminal peptide sequences (Gomez 
& Chrispeels, 1993, Plant Cell 5:1113-1124; Chrispeels & 
5 Raikhal, 1992, Cell 68:613-618; Holwerda et al . , 1992, Plant 
Cell 4:307-318; Neuhaus et al . , 1991, Proc. Natl. Acad. Sci. 
USA 88:10362-10366; Chrispeels, 1991, supra; Chrispeels & 
Tague, 1991, supra; Holwerda et al . , 1990, Plant Cell 
2:1091-1106; Voelker et al . , 1989, Plant Cell 1:95-104). As 
10 a result, plants have not been viewed as appropriate 
expression systems for lysosomal enzymes which must be 
appropriately processed to produce an active product. 



3 . SUMMARY OF THE INVENTION 

15 The present invention relates to the production of human 

or animal lysosomal enzymes in transformed or transfected 
plants, plant cells or plant tissues, and involves 
constructing and expressing recombinant expression constructs 
comprising lysosomal enzyme coding sequences in a plant 

20 expression system. The plant expression system provides 
appropriate co-translational and post-translational 
modifications of the nascent peptide required for processing, 
e.g., signal sequence cleavage, glycosy lation, and sorting of 
the expression product so that an enzymatically active 

2 5 protein is produced. Using the methods described herein, 

recombinant lysosomal enzymes are produced in plant 
expression systems from which the recombinant lysosomal 
enzymes can be isolated and used for a variety of purposes. 
The present invention is exemplified by the genetic- 

3 0 engineering of transgenic tobacco plants with three lysosomal 

enzyme expression constructs. One construct comprises a 
nucleotide sequence encoding a modified human 

glucocerebrosidase (hGC) , specifically a hGC fused at its C- 
terminal to the eight amino acid FLAG™ peptide (hGC: FLAG™) . 
35 Another construct comprises nucleotide sequence encoding a 
human a-L-iduronidase (IDUA) . The third construct comprises 
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a nucleotide sequence encoding a human glucocerebrosidase 
(hGC) - Transgenic tobacco plants having the expression 
constructs produce lysosomal enzymes that are enzymatically 
active. 

5 The plant expression systems and the recombinant 

lysosomal enzymes produced therewith have a variety of uses, 
including but not limited to: (1) the production of 
enzymatically active lysosomal enzymes for the treatment of 
lysosomal storage diseases; (2) the production of altered or 

10 mutated proteins, enzymatically active or otherwise, to serve 
as precursors or substrates for further in vivo or In vitro 
processing to a specialized industrial form for research or 
therapeutic uses, such as to produce a more effective 
therapeutic enzyme; (3) the production of antibodies against 

15 lysosomal enzymes for medical diagnostic use; and (4) use in 
any commercial process that involves substrate hydrolysis. 

4. BRIEF DESCRIPTION OF THE FIGURES 

FIG. 1. hGC: FLAG" 1 cDNA plant expression construct and 
20 transformation vector. The MeGA : hGC : FLAG™ construct in a pBS 
intermediate vector is excised and inserted into the SstI 
site of the binary plant transformation vector pBIB-KAN to 
form plasmid CTProl : hGC: FLAG. R and L represent T-DNA right 
and left borders, respectively, which precisely delineate the 
25 DNA inserted into the plant genome. NPTII = kanamycin 
selectable marker, FL = FLAG™ epitope, pAnos = 
polyadenylation/ terminator signal, Pnos = promoter seguence 
from Agrobacterium tumefaciens nopaline synthetase gene. 
PCR-amplif ication primers for hGC were: GC1 
30 ( 5 ' TTGtcTAGaGTAAGCATCATGGCTGGC3 ' ) ( SEQ ID NO : 1 ) ; and GC4 
( 5 ' cacaaattCTGGCGACGCCACAGGTAGGTGTGA3 ' ) (SEQ ID NO : 2 ) ; 
hGC-derived sequences are in upper case; restriction sites 
are underlined. Restriction enzymes: E, EcoRI ; S, SstI ; N, 
NotI; X, Xbal. 

35 FIGS. 2A-E. Transformation and generation of tobacco 

plants carrying the MeGA: hGC: FLAG™ construct. FIG. 2A. 
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Agrobacterium-mediated transformation of tobacco leaf discs. 
Leaf discs were inoculated with a cell suspension of 
A. tumefaciens strains carrying the plasmid CTProl : hGC : FLAG - 
FIG. 2B. Development of shoots on selection media 22 days 
5 post-inoculation. FIG . 2C. Development of roots on rooting 
media 27 days post-inoculation. Use of rooting media 
containing kanamycin clearly differentiated between 
transgenic shoots which formed roots and "false positive" 
shoots which did not form roots on selective media. FIG. 2D. 
10 Transformed plants three weeks after transfer to soil. FIG. 
2E. Transformed plant 10 weeks after transfer to soil. 

FIG. 3. Genomic Southern hybridization analysis of 
control and transgenic plants. Total genomic DNA was 
isolated from an untransf ormed control plant (UT) and 
15 independent transf ormants generated from Nlcotlana tabacum 

cv. Xanthl (X-l, X-8 r X-9, X-ll) and cv. VA116 (VI). Five to 
10 /ig of total genomic DNA were digested with Hindi I I and 
resolved on a TBE agarose gel. The DNA was blotted to 
nitrocellulose membrane and probed with a 32 P-labeled 
20 hGC: FLAG™ sequence from a gel-purified 1.7 kb Hindlll 

fragment isolated from the pBS intermediate vector containing 
the MeGA: hGC: FLAG™ expression construct (see FIG. 1). 

FIG. 4. Induction of hGC : FLAG™ mRNA levels in 
transgenic plants. Total RNA was isolated by standard 
25 guanidino-thiocyanate methods from UT and X-ll leaf tissue at 
0 and 24 hr post-mechanical gene activation (MGA) . Five jig 
of total RNA was glyoxylated, size-separated on a 1.2% 
agarose gel, transferred to NitroPure (MSI) filters and 
probed with a 32 P- labeled hGC: FLAG™ gene sequence from a 
30 gel-purified 1.7 kb Hindlll fragment isolated from the pBS 
intermediate vector shown in FIG. 1. 

FIGS. 5A-B. Induction of hGC: FLAG 1 " fusion protein in 
transgenic tobacco plants as detected by Western analysis 
using anti-FLAG™ antibodies and anti-hGC antibodies. Leaf 
35 tissue from X-ll was induced by MGA at time 0 at room 

temperature, harvested at 2, 4, 8, 16 , and 24 hrs # and frozen 
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at -20°C prior to extraction. hGC : FLAG™ was solubilized by 
grinding the tissue in a coffee bean grinder with dry ice and 
homogenized in 1% Triton X-100, 1% taurocholate , 25 mM sodium 
citrate pH 7.0, 4 mM 3-mercaptoethanol , and 5 mM 
5 ethylenediaminetetraacetic acid (EDTA) , followed by two 
cycles of freezing and thawing of the homogenate. Both 
protein concentration and enzyme activity of cell free 
extracts were determined. FIG. 5A. Ten fig of total soluble 
protein were analyzed by Western immunoblot using anti-FLAG™ 

10 antibodies. Lane 1, 24 ng of FLAG™ -tagged control protein; 
lane 2, x-ll at time 0; lane 3, X-ll at 2 hr; lane 4, X-ll at 
4 hrs; lane 5, X-ll at 8 hrs; lane 6, X-ll at 12 hrs; lane 7, 
X-ll at 24 hrs; lane 8, UT (control plant) at 12 hrs. FIG. 
5B. Forty /ig of total soluble protein were analyzed by 

15 Western immunoblot using anti-hGC antibodies. Lane 1 , UT at 
time 0; lane 2, X-ll at time 0; lane 3, X-ll at 2 hrs; lane 
4, X-ll at 4 hrs; lane 5, X-ll at 8 hrs; lane 6, X-ll at 12 
hrs, lane 7 f X-ll at 24 hrs; lane 8, UT at 8 hrs. The 
maximum level of hGC: FLAG™ expression was found between 8-12 

2 0 hrs post-MGA. 

FIG. 6. Total B-glucosidase (endogenous plant /3- 
glucosidase and hGC) activity post-MGA of X-ll leaf tissue. 
One-tenth ^g of cell free extract was assayed for ability to 
convert the fluorometric substrate, 4-methylumbellif eryl 

2 5 -D-glucopyranoside (4MuGlc) to 4MU at 3 7 °C, as measured in a 

fluorometer (Hoefer DyNA Quant-200, Hoefer, Pharmacia, 
Biotech. Inc.) with excitation at 365 nm and emission at 460 
nm. FU = fluorometer units; Time = hrs post-induction (i.e., 
wounding of tissue or MGA) . 

3 0 FIGS. 7A-B. Affinity purification of hGC: FLAG™ fusion 

protein. FIG. 7A. Commassie blue stained SDS-PAGE gel and 
Western analysis of FLAG™ affinity-purified hGC: FLAG™. Lane 
1, Cooraassie blue stained SDS-PAGE gel of 0.1 Mg FLAG™ 
affinity-purified hGC: FLAG™; Lane 2, Western analysis using 
35 anti-hGC antibodies on 0.1 Mg FLAG™ affinity-purified 

hGC: FLAG™. FIG. 7B. Commassie blue stained SDS-PAGE gel and 
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Western analysis of ConA-af f inity-purif ied hGC : FLAG™ . Lane 
1, Coomassie blue stained SDS-PAGE gel of 10 nq of ConA 
purified hGC : FLAG™ ; Lane 2, Western analysis of ConA 
purified hGC: FLAG™ using anti-FLAG™ antibodies. These 
5 results indicate that the ConA-purif ied hGC: FLAG™ protein is 
glycosylated . 

FIG. 8. Immuno-slot blot Western analysis using 
ant i— FLAG™ antibodies on fractions from hGC: FLAG™ 
purification steps using plant tissue 12 hrs post-MGA. Lane 

10 A, FLAG™-tagged control protein: slot 1, 1 ng; slot 2, 6 ng; 
slot 3, 8 ng; slot 4, 18 ng; slot 5, 60 ng. Lane B, 
Fractions from isolation of hGC : FLAG™ : slot 1, 0.5 Ml/80,000 
Ml soluble protein from crude cell free extract; slot 2, 0.5 
Ml/80,000 Ml soluble protein from 33% ammonium sulfate (AS) 

15 supernatant; slot 3, 2.5 Ml/5,000 Ml soluble protein from 
ConA affinity-purified hGC : FLAG™ . Lane C: slot 1, 1 Ml 
soluble protein from crude plant tissue extract; slot 2, l Ml 
soluble protein from 33% AS supernatant; slot 3, 5 m1 soluble 
protein from ConA affinity-purified hGC: FLAG™. 

20 FIG. 9. Nucleotide sequence of hGC : FLAG™ construct (SEQ 

ID NO: 3) which was cloned and expressed in tobacco strains X- 
11 and X-27. The upper case underlined letters at three 
positions represent changes to the sequence in GENBANK (ATCC 
bank cDNA sequence) . The lower case letters represent 

25 additions to the hGC sequence , e.g., the FLAG™ epitope. 

FIG. 10. Deduced amino acid sequence of hGC: FLAG™ 
fusion protein (SEQ ID NO: 4) . The upper case underlined 
letters at two positions represent changes to the original 
hGC amino acid sequence disclosed by E. Neufled. Lower case 

30 letters represent additions to the hGC amino acid sequence. 
For example, dykddddk = the FLAG™ epitope. 

FIG. 11. Sequence of 456 bases comprising the MeGA 
promoter . 

FIG. 12. IDUA expression vector construction strategy. 
35 MeGA: IDUA and 355^: IDUA constructs were inserted into the 
Hindlll/SacI site of the binary vector pBIB-KAN. R and L 
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represent T-DNA right and left borders which precisely 
demarcate the DNA inserted into the plant genome, NPTII is 
the kanamycin selectable marker, pAnos is the 
polyadenylation/ terminator signal and Pnos a promoter from 
5 Agrobacterlum tumefaclens nopaline synthetase gene. 
PCR-primers for IDUA were: ID1, 

( 5 ' -CTAG tctaqa ATGCGTCCCCTGCGCCCCCGCG) ( SEQ ID NO: 6) and ID2 , 
( 5 ' G qaattcqaactc TCATGGATTGCCCGGGGATG) (SEQ ID NO: 7); IDUA 
sequences are capitalized, introduced restriction sites are 

10 underlined. SP, signal peptide; IDUA, human IDUA coding 
region; H, Hindlll; S, Sad; X, Xbal. 

FIGS. 13A-C. Transgenic tobacco expressing the 
MeGA: IDUA construct. Fig. 13A. Germination of first 
generation seeds on selective medium showing segregation of 

15 kanamycin resistant and sensitive seedlings. Fig. 13B. 

Young plants containing the MeGA: IDUA construct (right) and 
untransf orroed parent plants grown in parallel. Fig. 13C. 
Fully mature IDUA-expressing plants in the greenhouse. 

FIGS. 14A-B. Induction of IDUA transgene in tobacco 

2 0 leaf tissues. Leaf tissue from transgenic plant IDUA-9 was 
induced by excision into 1.5 mm strips and incubated at room 
temperature on moist paper towels in sealed plastic bag. 
Tissue was removed for analysis (stored at -80°C for RNA, 
-20°C for protein) at 0, 2,4, 8, 11, and 27 hrs 

25 post-induction. FIG. 14A. Northern blot analysis of IDUA 
mRNA from transgenic tobacco plants. Fifteen fiq of total RNA 
was run on glyoxal agarose gel, blotted onto nitrocellulose 
membrane, and hybridized with 32 P-labeled IDUA cDNA . FIG. 
14B. Western blot analysis of total soluble proteins (20 nq) 

30 from tobacco leaf extracts using antibodies to denatured IDUA 
synthesized in CHO cells. Control lane represents IDUA 
synthesized in CHO cells (98 kDa under our gel conditions) . 
IDUA synthesized from transgenic tobacco has a molecular size 
of 92 kDa. 

35 FIG. 15. Immunodetection of IDUA secreted by transgenic 

plants into the incubation buffer. Fifty fil of incubation 
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buffer was boiled and slotted onto OPTITRAN membrane along 
with control IDUA synthesized in CHO cells. Antibodies to 
denatured IDUA synthesized in CHO cells were used to detect 
IDUA. 

5 FIG. 16. IDUA activity in tissue extracts and 

incubation buffer from transgenic IDUA-9 plant tissue. Panel 
A: IDUA-9 plant tissue was induced and incubated in buffer, 
which was collected and replaced at various times after 
induction as described in the text. Open boxes represent 

10 IDUA activity in extracts prepared from induced tissue after 
incubation in buffer. Shaded boxes represent the IDUA 
activity in the incubation buffer. Panel B: IDUA-9 plant 
issue was induced and incubated without buffer for 3 4 hours 
after which an extract was prepared from the induced tissue. 

15 The IDUA activity of the extract is shown. 

FIG. 17. Comparison of IDUA activity in transgenic 
tobacco plants IDUA-7 , IDUA-8 and IDUA-9: Panel A: Plant 
tissue was induced and incubated in buffer, which was 
collected and replaced at various times after induction as 

20 described in the text. IDUA activity present in the 

incubation buffer collected at various times post-inducton 
was plotted. Panel B: Plant tissue was induced and incubated 
without buffer absence of incubation buffer for 34 hours, 
after which extracts were prepared from the induced tissues. 

2 5 The IDUA activities of the extracts are shown. 

FIG. 18. Western slot blot analysis of secreted IDUA 
from transgenic plant IDUA-9 after three sequential addition 
and collection of incubation buffer; 24, 26 and 34 hrs post- 
MGA. The tissue (1.5 gm) was induced and incubated in a 

30 moist plastic bag for 24 hrs. Ten ml of incubation buffer 
was used to wash the tissue; this fraction is denoted as 24 
hrs. Fresh buffer (10 ml) was added and incubated at room 
temperature for 2 hrs; this fraction was denoted as 26 hrs. 
Fresh buffer (10 ml) was added to the tissue and incubated 

35 for 8 hrs and this fraction was denoted as 34 hrs. Fifty mL 
of incubation buffer from each fraction was boiled and 
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slotted onto OPTITRAN membrane and analyzed with anti-IDUA 
antibodies. 

FIG. 19. The nucleotide sequence of the IDUA coding 
sequence used in the MeGA : IDUA and 35S ENH :IDUA expression 
5 construct. 

FIG. 20. The deduced amino acid sequence of the IDUA 

coding sequence shown in FIG. 19. 

FIG. 21. hGC cDNA plant expression construct and 

transformation vector. The MeGA: hGC expression construct in 
10 a pBS intermediate plasmid is excised and inserted into the 

SstI site of the binary plant transformation vector pBIB-KAN 

to form transformation vector pCT50. The PCR-amplif ication 

primers for reconstruction of the 3' end of the hGC coding 

region were: GC2 3, which has the sequence 
15 5 ' GCCTATGCTGAGCACAAGTTACAG3 ' (SEQ ID NO: 11); and GC3 7 r whose 

complementary strand has the sequence 

5 ' TTCCTTGAGCTCGTCACTGGCGACGCCACAGGTA3 ' (SEQ ID NO: 12). The 
other abbreviations and notations shown are same as those 
described for FIG. 1- 

20 

5. DETAILED DESCRIPTION OF THE INVENTION 

The present invention relates to the production of 
recombinant human or animal lysosomal enzymes in plants and 
in cultured plant cells and plant tissues , involving: (1) 

25 construction of recombinant expression constructs comprising 
lysosomal enzyme coding sequences and transformation vectors 
containing the expression constructs; (2) transforming or 
transfecting plant cells, plant tissues or plants with the 
transformation vectors; (3) expressing the lysosomal enzyme 

30 coding sequences in the plant cell, plant tissue or plant; 
and (4) detecting and purifying expression products having 
lysosomal enzyme activity. 

The plant expression systems and the recombinant 
lysosomal enzymes produced therewith have a variety of uses, 

35 including but not limited to: (l) the production of 

enzymatically active enzymes for the treatment of lysosomal 
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storage diseases; (2) the production of antibodies against 
lysosomal enzymes, which antibodies would have medical 
diagnostic uses; (3) use in any commercial process that 
involves substrate hydrolysis; and (4) the production of 
5 modified proteins or peptide fragments to serve as precursors 
or substrates for further in vivo or in vitro processing to a 
specialized industrial form for research or therapeutic uses, 
such as to produce a therapeutic enzyme with increased 
efficacy or altered substrate specificity. These plant - 
10 expressed recombinant lysosomal protein products need not be 
enzymatically active or identical in structure to the 
corresponding native animal or human lysosomal enzymes or 
proteins in order to be useful for research or industrial 
applications . 

15 The terms "lysosomal enzyme" and "lysosomal enzyme gene 

product," as used herein with respect to any such enzyme and 
product produced in a plant expression system, refer to a 
recombinant peptide expressed in a transgenic plant or plant 
cell from a nucleotide sequence encoding a human or animal 

2 0 lysosomal enzyme, a modified human or animal lysosomal 

enzyme, or a fragment, derivative or modification of such 
enzyme. Useful modified human or animal lysosomal enzymes 
include but are not limited to human or animal lysosomal 
enzymes having one or several naturally-occuring or 

25 artifically- introduced amino acid additions, deletions and/or 
substitutions . 

The term "lysosomal enzyme coding sequence," as used 
herein, refers to a DNA or RNA sequence that encodes a 
protein or peptide, or a fragment, derivative or other 

30 modification thereof, which exhibits detectable enzymatic 
activity against a lysosomal enzyme substrate. 

The term "enzymatically active" is used herein with 
respect to any recombinant lysosomal enzyme produced in a 
plant expression system to mean that the recombinant 

35 lysosomal enzyme is able to hydrolyze either the natural 

substrate, or an analogue or synthetic substrate thereof of 
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the corresponding human or animal lysosomal enzyme, at 
detectable levels . 

The term "enzymatically active" is also used herein with 
respect to recombinant hGC and modified hGC produced in a 
5 plant expression system to mean that such hGCs are able to 
hydrolyze the native hGC substrate, i.e., 

N-acyl-shingosyl-l-o-B- D-glucoside, of the hGC or that it 
can cleave the synthetic B-glucoside, 

4-methyl-umbelliferyl-B-D-glucoside (4MuGlc) , at detectable 

10 levels. Similarly, the term as applied to plant-produced 
IDUA and modified IDUA means that such IDUAs are able to 
hydrolyze the native IDUA substrate, i.e., dermatan sulfate 
or heparan sulfate, or is able to cleave the synthetic a- 
glucoside, 4-methylumbellif eryl-a-L-iduronide (4-MUI) , at 

15 detectable levels. 

The term "transformant" as used herein refers to a 
plant, plant cell or plant tissue to which a gene construct 
comprising a lysosomal enzyme coding sequence has been 
introduced by a method other than transfection with an 

20 engineered virus. 

The term "transf ectant" refers to a plant, plant cell or 
plant tissue that has been infected with an engineered virus 
and stably maintains said virus in the infected cell. 

Once a plant transformant or transfectant is identified 

25 that expresses a recombinant lysosomal enzyme, one 

non-limiting embodiment of the invention involves the clonal 
expansion and use of that transformant or transfectant in the 
production and purification of enzymatically active 
recombinant lysosomal enzyme. In another non-limiting 

30 embodiment of the invention, each new generation of progeny 
plants may be newly screened for the presence of nucleotide 
sequence coding for a lysosomal enzyme, wherein such 
screening results in production by subsequent generations of 
plants of recoverable amounts of active recombinant lysosomal 

35 enzyme, and wherefrom the enzyme is then purified. 
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The invention is divided into the following sections 
solely for the purpose of description: (a) genes or coding 
sequences for lysosomal enzymes involved in lysosomal storage 
diseases; (b) construction of recombinant expression 
5 constructs for expressing lysosomal enzyme coding sequences 
in plant cell; (c) construction of plant transformation 
vectors comprising the expression constructs; (d) 
transf ormation/transf ection of plants capable of translating 
and processing primary translation products in order to 
10 express an enzymatically active recombinant lysosomal enzyme; 
(e) identification and purification of the recombinant 
lysosomal enzyme so produced; (f) expansion of the number of 
transformed or transf ected plants; and (g) methods of 
therapeutically using the recombinant lysosomal enzyme. 

15 

5.1. GENES OR CODING SEQUENCES FOR ENZYMES 
INVOLVED IN LYSOSOMAL STORAGE DISEASES 

The recombinant lysosomal enzymes produced in accordance 

with this invention will have a variety of uses, probably the 

20 most significant being their use in enzyme replacement 

therapy for lysosomal storage diseases. These lysosomal 

enzymes include but are not limited to: 

a-N-acetylgalactosaminidase (Warner et al. , Biochem. Biophys. 
Res. Commun. , I990 f 173:13-19; acid lipase; aryl sulfatase A; 

25 aspartylglycosaminidase; ceramidase; a-L-fucosidase (de Wet 
et al . , 1984, DNA 3:437-447), a-galactosidase , 
6-galactosidase, galactosylceramidase, glucocerebrosidase, 
a-glucosidase, fc-glucuronidase , heparin N-sulfatase, 
B-hexosaminidase, iduronate sulfatase, a-L-iduronidase, 

3Q a-mannosidase, S-mannosidase, sialidase, and sphingo- 
myelinase. Of these enzymes, cDNAs have been cloned for 
a-N-acetylgalactosaminidase (Zhu & Goldstein, 199 3, Gene 
137:309-314); acid lipase (Amesis et al . , 1994, Eur. J. 
Biochem 219:905-914); a-galactosidase (Eng & Desnick, 1994, 

35 Hum Mutat. 3:103-111); human glucocerebrosidase (hGC) (Sorge 
et al . , 1985, supra); a-L-iduronidase (Scott et al . , 1991, 
Proc. Natl. Acad. Sci. USA 88:9695-9699); iduronate sulfatase 
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(Daniele et al • , 1993, Genomics 16:755-757); a-mannosidase 
(Schatzle et al . , 1992, J. Biol. Chem 267:4000-4007); and 
sialidase (Ferrari et aJ . , 1994, Glycobiology 4:2047-2052). 
The nucleic acid sequences encoding lysosomal enzymes 
5 which can be used in accordance with the invention include 
but are not limited to any nucleic acid sequence that encodes 
a lysosomal enzyme, modified lysosomal enzyme, or functional 
equivalent thereof, including but not limited to: (a) any 
nucleotide sequence that selectively hybridizes to the 

10 complement of a human or animal lysosomal enzyme coding 
sequence under stringent conditions, e.g., washing in 
O.lxSSC/0.1 % SDS at 68'C (Ausubel et al . , eds. , 1989, 
Current Protocols in Molecular Biology. Vol. I . Greene 
Publishing Associates, Inc. and John Wiley & Sons, Inc., New 

15 York, at page 2.10.3), and encodes a product homologous to 
the human or animal lysosomal enzyme; and/or (b) any 
nucleotide sequence that hybridizes to the complement of the 
human or animal lysosomal enzyme coding sequence under less 
stringent conditions, such as moderately stringent 

20 conditions, e.g., washing in 0.2xSSC/0.1 % SDS at 42 # C 
(Ausubel et al . , 1989, supra), yet which still encodes a 
homologous gene product that is enzymatically active; and (c) 
any nucleotide coding sequence that otherwise encodes a 
protein from any organism capable of hydrolyzing a human or 

25 animal lysosomal enzyme's native substrate or substrate 
analogue. 

The invention also includes but is not limited to: 
(a) DNA vectors that contain any of the foregoing nucleotide 
coding sequences and/or their complements; (b) DNA expression 

30 and transformation vectors that contain expression constructs 
comprising any of the foregoing nucleotide coding sequences 
operatively associated with a regulatory element that directs 
expression of the coding sequences in plant cells or plants; 
and (c) genetically engineered plant cells or plants that 

35 contain any of the foregoing coding sequences, operatively 
associated with a regulatory element that directs the 
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expression of the coding and/ or antisense sequences in the 
plant cell. As used herein, the term "regulatory element" 
includes but is not limited to inducible and non-inducible 
promoters, enhancers, operators and other elements known to 
5 those skilled in the art that drive and/or regulate gene 
expression. The invention also includes fragments, 
derivatives or other modifications of the DNA sequences 
described herein. 

10 5*2. TRANSFORMATION VECTORS TO DIRECT THE EXPRESSION 
OF LYSOSOMAL ENZYME CODING SEQUENCE 

5. 2.1. LYSOSOMAL ENZYME EXPRESSION CONSTRUCTS 

In order to express a lysosomal enzyme in a plant 
expression system, the lysosomal enzyme coding sequence is 

15 inserted into an appropriate expression construct and the 
expression construct is incorporated into a transformation 
vector for transfer into cells of the plant. The expression 
construct is preferably constructed so that the lysosomal 
enzyme coding sequence is operatively associated with one or 

2Q more regulatory elements, including, e.g., promoters and/or 
enhancers, necessary for transcription and translation of the 
lysosomal enzyme coding sequence. Methods to construct the 
expression constructs and transformation vectors include 
standard in vitro genetic recombination and manipulation. 

25 See, for example, the techniques described in Weissbach and 
Weissbach, 1988, Methods For Plant Molecular Biology , 
Academic Press, Chapters 26-28. 

Regulatory elements that may be used in the expression 
constructs include promoters which may be either heterologous 

3Q or homologous to the plant cell. The promoter may be a plant 
promoter or a non-plant promoter which is capable of driving 
high levels transcription of a linked sequence in plant cells 
and plants. Non- limiting examples of plant promoters that 
may be used effectively in practicing the invention include 

35 cauliflower mosaic virus (CaMV) 35S, rJbcS, the promoter for 
the chlorophyll a/b binding protein, AdhI , NOS and HMG2 , or 
modifications or derivatives thereof. The promoter may be 
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either constitutive or inducible. For example, and not by 
way of limitation, an inducible promoter can be a promoter 
that promotes expression or increased expression of the 
lysosomal enzyme nucleotide sequence after mechanical gene 
5 activation (MGA) of the plant, plant tissue or plant cell. 
One non-limiting example of such an MGA-inducible plant 
promoter is MeGA (described infra) . 

The expression constructs can be additionally modified 
according to methods known to those skilled in the art to 

10 enhance or optimize heterologous gene expression in plants 
and plant cells. Such modifications include but are not 
limited to mutating DKA regulatory elements to increase 
promoter strength or to alter the lysosomal enzyme coding 
sequence itself. Other modifications include deleting intron 

15 sequences or excess non-coding sequences from the 5 ' and/or 
3' ends of the lysosomal enzyme coding sequence in order to 
minimize sequence- or distance-associated negative effects on 
expression of hGC, e.g., by minimizing or eliminating message 
destabilizing sequences. 

20 The expression constructs may be further modified 

according to methods known to those skilled in the art to 
add, remove, or otherwise modify peptide signal sequences to 
alter signal peptide cleavage or to increase or change the 
targeting of the expressed lysosomal enzyme through the plant 

25 endomembrane system. For example, but not by way of 

limitation, the expression construct can be specifically 
engineered to target the lysosomal enzyme for secretion, or 
vacuolar localization, or retention in the endoplasmic 
reticulum (ER) . 

30 In one embodiment, the expression construct can be 

engineered to incorporate a nucleotide sequence that encodes 
a signal targeting the lysosomal enzyme to the plant vacuole. 
For example, and not by way of limitation, the N-terminal 14 3 
amino acid domain derived from the plant vacuolar protein, 

35 proaleurain (Holwerda et al . , 1992, supra; Holwerda et ai . , 
1990, supra) , may be engineered into the expression construct 
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to produce a signal peptide- lysosomal enzyme fusion product 
upon transcription and translation. The proaleurain signal 
peptide will direct the lysosomal enzyme to the plant cell 
vacuole, but is itself cleaved off during transit through the 
5 plant endomembrane system to generate the mature protein. 

In another non-limiting embodiment, a signal peptide may 
be engineered into the expression construct to direct the 
lysosomal enzyme to be secreted from the plant cell. For 
example, and not by way of limitation, the signal peptide of 
10 tobacco PR-1, which is a secreted pathogenesis-related 

protein (Cornelissen et al . , 1986, EMBO J. 5:37-40), can be 
engineered into the expression construct to direct the 
secretion of the lysosomal enzyme from the plant cell. 

In an additional non-limiting embodiment, the signal 
15 peptide may be engineered into the expression construct to 
direct the lysosomal enzyme to be retained within the ER. 
Such ER-retained lysosomal enzymes may exhibit altered, and 
perhaps preferable, glycosylation patterns as a result of 
failure of the peptide to progress through the Golgi 
20 apparatus, thus resulting in a lack of subseguent glycosyl 
processing. For example, and not by way of limitation, a 
nucleotide sequence can be engineered into the expression 
construct to result in fusion of the amino acid sequence 
KDEL, i.e., Lys-Asp-Glu-Leu, to the carboxyl -terminus of the 
25 lysosomal enzyme. The KDEL sequence results in retention of 
the lysosomal enzyme in the ER (Pfeffer and Rothman, 1987, 
Ann. Rev. Biochem. 56:829-852). 

Expression construct may be further modified according 
to methods known to those skilled in the art to add coding 
30 sequences that facilitate purification of the lysosomal 
enzyme. In one non-limiting embodiment, a nucleotide 
sequence coding for the target epitope of a monoclonal 
antibody may be engineered into the expression construct in 
operative association with the regulatory elements and 
35 situated so that the expressed epitope is fused to the 

lysosomal enzyme. For example, and not by way of limitation, 
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a nucleotide sequence coding for the FLAG™ epitope tag 
(International Biotechnologies, Inc. , IBI) , which is a 
hydrophilic marker peptide , can be inserted by standard 
techniques into the expression construct at a point 
5 corresponding to the carboxyl -terminus of the lysosomal 

enzyme. The expressed FLAG™ epitope- lysosomal enzyme fusion 
product may then be detected and affinity-purified using 
anti-FLAG™ antibodies. 

In another non-limiting embodiment, a nucleotide 

10 sequence can be engineered into the expression construct to 
provide for a cleavable linker sequence between the lysosomal 
enzyme peptide sequence and any targeting signal, reporter 
peptide, selectable marker, or detectable marker, as 
described supra, that has not otherwise been cleaved from the 

15 lysosomal enzyme peptide sequence during peptide processing 
and trafficking through the plant endomembrane system. Such 
a linker sequence can be selected so that it can be cleaved 
either chemically or enzymatically during purification of the 
lysosomal enzyme (Light et al . , 1980, Anal. Biochem. 106:199- 

20 206) . 

5.2.2. PLANT TRANSFORMATION VECTORS 

The transformation vectors of the invention may be 
developed from any plant transformation vector known in the 

25 art include, but are not limited to, the well-known family of 
Ti plasmids from Agrobacterlum and derivatives thereof, 
including both integrative and binary vectors, and including 
but not limited to pBIB-KAN, pGA471, pEND4K, pGV3850, and 
pMON505. Also included are DNA and RNA plant viruses, 

30 including but not limited to CaMV, geminiviruses , tobacco 
mosaic virus, and derivatives engineered therefrom, any of 
which can effectively serve as vectors to transfer a 
lysosomal enzyme coding sequence, or functional equivalent 
thereof, with associated regulatory elements, into plant 

35 cells and/or autonomously maintain the transferred sequence. 
In addition, transposable elements may be utilized in 



-24- 



WO 97/10353 



PCTAJS96/14730 



conjunction with any vector to transfer the coding sequence 
and regulatory sequence into a plant cell. 

To aid in the selection of transf ormants and 
transfectants, the transformation vectors may preferably be 
5 modified to comprise a coding sequence for a reporter gene 
product or selectable marker. Such a coding sequence for a 
reporter or selectable marker should preferably be in 
operative association with the regulatory element coding 
sequence described supra, 

10 Reporter genes which may be useful in the invention 

include but are not limited to the 6-glucuronidase (GUS) gene 
(Jefferson et al . , 1986, Proc. Natl. Acad. Sci. USA, 
83:8447), and the luciferase gene (Ow et al . , 1986, Science 
234:856). Coding sequences that encode selectable markers 

15 which may be useful in the invention include but are not 
limited to those sequences that encode gene products 
conferring resistance to antibiotics, anti-metabolites or 
herbicides, including but not limited to kanamycin, 
hygromycin , streptomycin , phosphinothricin , gentamicin , 

20 methotrexate, glyphosate and sulfonylurea herbicides, and 
include but are not limited to coding sequences that encode 
enzymes such as neomycin phosphotransferase II (NPTII) , 
chloramphenicol acetyltransf erase (CAT) , and hygromycin 
phosphotransferase I (HPT, HYG) . 

25 

5,3. TRAK8FORMATION/TRMJ8FECTIOK OF PLANTS 

A variety of plant expression systems may be utilized to 
express the lysosomal enzyme coding sequence or its 
functional equivalent. Particular plant species may be 

30 selected from any dicotyledonous, monocotyledonous species, 
gymnospermous , lower vascular or non-vascular plant, 
including any cereal crop or other agriculturally important 
crop. Such plants include, but are not limited to, alfalfa, 
Arabldopsis, asparagus, barley, cabbage, carrot, celery, 

35 corn, cotton, cucumber, flax, lettuce, oil seed rape, pear, 



-25- 



WO 97/10353 



PCT7US96/14730 



peas, petunia, poplar, potato, rice, soybean, sugar beet, 
sunflower, tobacco, tomato, wheat and white clover. 

Methods by which plants may be transformed or 
transfected are well-known to those skilled in the art. See, 
5 for example, Plant Biotechnology . 1989, Kung & Arntzen, eds., 
Butterworth Publishers, ch. 1, 2. Examples of transformation 
methods which may be effectively used in the invention 
include but are not limited to Agrobacterium -media ted 
transformation of leaf discs or other plant tissues, 

10 microinjection of DNA directly into plant cells, 

electroporation of DNA into plant cell protoplasts, liposome 
or spheroplast fusion, microprojectile bombardment, and the 
transfection of plant cells or tissues with appropriately 
engineered plant viruses. 

15 Plant tissue culture procedures necessary to practice 

the invention are well-known to those skilled in the art. 
See, for example, Dixon, 1985, Plant Cell Culture; A 
Practical Approach , IRL Press. Those tissue culture 
procedures that may be used effectively to practice the 

2 0 invention include the production and culture of plant 

protoplasts and cell suspensions, sterile culture propagation 
of leaf discs or other plant tissues on media containing 
engineered strains of transforming agents such as, for 
example, Agrobacterium or plant virus strains and the 

25 regeneration of whole transformed plants from protoplasts, 
cell suspensions and callus tissues. 

The invention may be practiced by transforming or 
transfecting a plant or plant cell with a transformation 
vector containing an expression construct comprising a coding 

30 sequence for the lysosomal enzyme and selecting for 

transf ormants or transf ectants that express the lysosomal 
enzyme. Transformed or transfected plant cells and tissues 
may be selected by techniques well-known to those of skill in 
the art, including but not limited to detecting reporter gene 

3 5 products or selecting based on the presence of one of the 

selectable markers described supra. The transformed or 
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transfected plant cells or tissues are then grown and whole 
plants regenerated therefrom. Integration and maintenance of 
the lysosomal enzyme coding sequence in the plant genome can 
be confirmed by standard techniques, e.g., by Southern 
5 hybridization analysis, PCR analysis, including reverse 

transcriptase-PCR (RT-PCR) , or immunological assays for the 
expected protein products. Once such a plant transformant or 
transfectant is identified, a non-limiting embodiment of the 
invention involves the clonal expansion and use of that 
10 transformant or transfectant in the production of lysosomal 
enzyme. 

As one non-limiting example of a transformation 
procedure, AgroJbacteriujn-mediated transformation of plant 
leaf disks can follow procedures that are well known to those 

15 skilled in the art. Briefly, leaf disks can be excised from 
axenically grown plant seedlings, incubated in a bacterial 
suspension, for example, 10* cfu/ml, of A. tumefaciens 
containing an engineered plasmid comprising a selectable 
marker such as, for example, kanamycin resistance, and 

20 transferred to selective "shooting" medium containing, for 
example, kanamycin, that will block growth of bacteria and 
untransf ormed plant cells and induce shoot initiation and 
leaf formation from transformed cells. Shoots are 
regenerated and then transferred to selective media to 

25 trigger root initiation. Stringent antibiotic selection at 
the rooting step is useful to permit only stably transformed 
shoots to generate roots. Small transgenic plantlets may 
then be transferred to sterile peat, vermiculite, or soil and 
gradually hardened off for growth in the greenhouse or in the 

30 field. 

5.4. IDENTIFICATION AND PURIFICATION OF 
THE LYSOSOMAL ENZYME GENE PRODUCT 

Transcription of the lysosomal enzyme coding sequence 

35 and production of the lysosomal enzyme in transformed or 

transfected plants, plant tissues, or plant cells can be 

confirmed and characterized by a variety of methods known to 
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those of skill in the art. Transcription of the lysosomal 
enzyme coding sequence can be analyzed by standard 
techniques, including but not limited to detecting the 
presence of lysosomal enzyme messenger ribonucleic acid 
5 (mFNA) transcripts in transformed or transfected plants or 
plant cells using Northern hybridization analysis or RT-PCR 
amplification. 

Detection of the lysosomal enzyme itself can be carried 
out using any of a variety of standard techniques, including, 

10 but not limited to, detecting lysosomal enzyme activity in 
plant extracts, e.g., by detecting hydrolysis either of the 
enzyme's natural substrate or a substrate analogue. 
Additionally, the lysosomal enzyme can be detected 
immunologically using monoclonal or polyclonal antibodies, or 

15 immuno-reactive fragments or derivatives thereof, raised 
against the enzyme, e.g. , by Western blot analysis, and 
limited amino acid sequence determination of the protein. 

Indirect identification of enzyme production in a plant 
can be performed using any detectable marker or reporter 

2 0 linked to the lysosomal enzyme. For example, but not by way 
of limitation, the FLAG™ epitope, which can be linked to the 
lysosomal enzyme, as described supra, is detectable in plant 
tissues and extracts using anti-FLAG M2 monoclonal antibodies 
(IBI) in conjunction with the Western Exposure™ 

2 5 chemi-luminescent detection system (Clontech) . 

Lysosomal enzyme production in a transformed or 
transfected plant can be confirmed and further characterized 
by histochemical localization, the methods of which are 
well-known to those skilled in the art.. See, for example, 

30 Techniques in Immunocvtochemistrv . Vol I . 1982, Bullock and 
Petrusz, eds., Academic Press, Inc. For example, but not by 
way of limitation, either fresh, frozen, or fixed and 
embedded tissue can be sectioned, and the sections probed 
with either polyclonal or monoclonal primary antibodips 

35 raised against the lysosomal enzyme or, for example, anti- 
FLAG™ monoclonal antibodies. The primary antibodies can then 
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10 



15 



be detected by standard techniques, e.g., using the 
biotinylated protein A-alkaline phosphatase-conjugated 
streptavidin technique, or a secondary antibody bearing a 
detectable label that binds to the primary antibody. 
5 The expression products can be further purified and 

characterized as described in the subsections below. 

5, 4.1. PRODUCTION AND PURIFICATION OP THE 

LYSOSOMAL ENZYME GENE PRODUCT 

One non- limiting method to produce and purify the 

lysosomal enzyme is described here, wherein the lysosomal 

enzyme coding sequence is operably associated with an 

inducible promoter in the expression construct. Leaf or 

other tissue or cells from a transgenic plant or cell culture 

transformed or transfected with this expression construct can 

be processed to induce expression of the lysosomal enzyme 

coding sequence. This induction process may include inducing 

the activation of lysosomal genes by one or more methods, 

applied separately or in combination, including but not 

limited to physical wounding or other mechanical gene 

activation (MGA) , and application of chemical or pathogenic 

elicitors or plant hormones. Lysosomal gene activation 

levels may also be enhanced in plant cells or tissues by 

factors such as the availability of nutrients, gases such as 

0 2 and C0 2 , and light or heat. After induction of expression, 

the tissue can be stored, e.g., at -20°C. If the lysosomal 

protein is targeted for localization within the plant cell, 

the plant cell wall must be penetrated to extract the 

protein. Accordingly, the plant tissue can be ground to a 

fine powder, e.g., by using a tissue grinder and dry ice, or 

homogenized with a ground glass tissue homogenizer. To 

resuspend the lysosomal enzyme, plant membranes must be 

solubilized using an extraction buffer containing a 

detergent, e.g., a bile detergent such as 1% (w/v) sodium 

taurocholate, in a buffered solution, e.g., 25 mM sodium 

citrate, pH 7.0. The homogenate can then be clarified by, 
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for example, centrif ugat ion at 10 , 000 x g for 3 0 min to 
produce a cell-free homogenate . 

The lysosomal enzyme must be further purified if it is 
to be useful as a therapeutic or research reagent. The 
5 lysosomal enzyme can be purified from plant extracts 

according to methods well-known to those of skill in the art 
(Furbish et al . , 1977, Proc. Natl. Acad. Sci. USA 74:3560- 
3563). Once the presence of the enzyme is confirmed it can 
be isolated from plant extracts by standard biochemical 

10 techniques including, but not limited to, differential 
ammonium sulfate (AS) precipitation, gel filtration 
chromatography or affinity chromatography, e.g., utilizing 
hydrophobic, immunological or lectin binding. At each step 
of the purification process the yield, purity and activity of 

15 the enzyme can be determined by one or more biochemical 
assays, including but not limited to: (l) detecting 
hydrolysis of the enzyme's substrate or a substrate analogue; 
(2) immunological analysis by use of an enzyme-linked 
immunosorbent assay (ELISA) ; (3) sodium dodecyl sulfate- 

20 polyacrylamide gel electrophoresis (SDS-PAGE) analysis; and 
(4) Western analysis. The enzyme may be alternatively or 
additionally purified by affinity chromatography wherein the 
enzyme binds to its inhibitor which is linked, for example, 
to an inert substrate. 

25 Once solubilized, all enzyme-containing fractions can be 

maintained, for example, by storage at 4°c, and stabilized if 
necessary, e.g., with 4 mM B-mercaptoethanol , 5 mM EDTA, 
and/or possibly with high levels of glycerol or ethylene 
glycol. 

30 

5.4.2. PROTEOLYTIC PROCESSING OF THE SIGNAL 
PEPTIDE 

In order to address whether the plant expression system 

efficiently recognizes and correctly cleaves the human signal 

peptide from the lysosomal enzyme, the plant-produced enzyme 

can be purified and analyzed by N-terminal sequencing. 

Accordingly, the enzyme can, for example, be treated with 
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Endo-F/N-glucanase (Boehringer Mannheim) to remove N-linked 
glycans, and the resulting peptide can be repurified by 
methods described supra. The purity of the enzyme can be 
determined based, for example, on silver-stained SDS-PAGE. 
5 The band containing the enzyme can be excised from the gel, 
the peptide eluted therefrom, and then analyzed by commercial 
N-terminal amino acid sequencing to determine whether the 
correct cleavage of the signal peptide has occurred. 
Incomplete cleavage can be detected, for example, as a double 
10 band on SDS-PAGE, or as mixed N-terminal sequences. 

5.4.3. N-LINKED GLYCOSYLATION IN PLANTS VERSUS 
ANIMALS 

The oligosaccharides of native human and animal 

15 lysosomal enzymes are typical antennary structures containing 
N-acetylglucosamine, mannose, and sialic acid. The 
glycoconjugate associated with the lysosomal enzyme of the 
invention may be determined, for example, by lectin binding 
studies (Reddy et al . , 1985, Biochem. Med. 33:200-210, 

20 Cummings, 1994, Meth. Enzymol. 230:66-86). 

Plant glycans do not contain sialic acid, which is a 
prevalent terminal sugar in mammalian glycans. In addition, 
the complex glycans of plants are generally smaller and 
contain a B 1-2 xylose residue attached to the B-linked 

25 mannose residues of the core (Gomez and Chrispeels, 1994, 
Proc. Natl. Acad. Sci. USA 91:1829-1833). 

Determination of the glycan composition and structure of 
the lysosomal enzyme of the invention is of particular 
interest because: (a) the glycan composition will indicate 

30 the status of the protein's movement through the Golgi; and 
(b) the presence of a complex glycan may indicate whether an 
antigenic response will be triggered in humans. 

Several molecular, genetic and chemical approaches can 
be used to raise the proportion of the high-mannose form of 

35 glycans on lysosomal enzymes, making them more similar in 
structure to the native human protein (Grabowski et al . , 
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1995, Ann. Int. Med. 122:33-39; Berg-Fussman et al . , 1993, 
J. Biol. Chem. 268:14861-14866). For example, but not by way 
of limitation, the mannose analog, l-deoxymanno j irimycin 
(dMM) , inhibits mannosidase I, the first Golgi-specif ic 
5 enzyme involved in glycan processing. Plant tissues treated 
with dMM produce glycoproteins which lack fucose and xylose 
and maintain a glycan profile consistent with inhibition at 
the mannosidase I step (Vitale et al . , 1989, PI. Phys. 
89:1079-1084). Treatment of lysosomal enzyme-expressing 
10 plant tissues with dMM may be useful to produce lysosomal 
enzymes with a relatively homogeneous high-mannose glycan 
profile. Such lysosomal enzymes should be highly effective 
for use in treatment of lysosomal storage diseases in human 
and animals. 

IS 

5.5. CLONAL PROPAGATION AND BREEDING OF TRANSGENIC 
PLANTS 

Once a transformed or transfected plant is selected that 

produces a useful amount of the recombinant lysosomal enzyme 

2Q of the invention, one embodiment of the invention 

contemplates the production of clones of this plant either by 
well-known asexual reproductive methods or by standard plant 
tissue culture methods. For example, tissues from a plant of 
interest can be induced to form genetically identical plants 

25 from asexual cuttings. Alternatively, callus tissue and/or 
cell suspensions can be produced from such a plant and 
subcultured. An increased number of plants can subsequently 
be regenerated therefrom by transfer to the appropriate 
regenerative culture medium. 

30 Alternatively, the recombinant lysosomal enzyme- 

producing plant may be crossed as a parental line, either 
male or female, with another plant of the same species or 
variety, which other plant may or may not also be transgenic 
for the lysosomal coding sequence, to produce an Fl 

35 generation. Members of the Fl and subsequent generations can 
be tested, as described supra, for the stable inheritance and 
maintenance of the lysosomal enzyme coding sequence, as well 
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as for lysosomal enzyme production. A breeding program is 
thus contemplated whereby the lysosomal enzyme coding 
sequence may be transferred into other plant strains or 
varieties having advantageous agronomic characteristics, for 
5 example, by a program of controlled backcrossing . The 
invention thus encompasses parental lines comprising the 
lysosomal enzyme coding sequence, as well as all plants in 
subsequent generations descending from a cross in which at 
least one of the parents comprised the lysosomal enzyme 

10 coding sequence. The invention further encompasses all seeds 
comprising the lysosomal enzyme coding sequence and from 
which such plants can be grown, and tissue cultures, 
including callus tissues, cell suspensions and protoplasts, 
comprising the lysosomal enzyme coding sequence, whether or 

15 not they can be regenerated back to plants. 

5.6. METHODS FOR THERAPEUTIC USE OF LYSOSOMAL ENZYMES 

The recombinant lysosomal enzymes of the invention are 
useful for therapeutic treatment of lysosomal storage 

2 0 diseases by providing a therapeutic amount of a particular 

lysosomal enzyme, or a derivative or modification thereof, to 
a patient suffering from a lysosomal storage disease or 
condition resulting from a deficiency of the corresponding 
human or animal active form of that enzyme. 

25 By "therapeutic amount" is meant an amount of 

enzymatically active lysosomal enzyme which will cause 
significant alleviation of clinical symptoms of a particular 
lysosomal storage disease. 

A therapeutic amount causes "significant alleviation of 

30 clinical symptoms" of the particular lysosomal storage 
disease if it serves to reduce one or more of the 
pathological effects or symptoms of the disease or to reduce 
the rate of progression of one or more of such pathological 
effects or symptoms. 

35 An effective dosage and treatment protocol may be 

determined by conventional means, starting with a low dose in 
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laboratory animals and then increasing the dosage while 
monitoring the effects, and systematically varying the dosage 
regimen as well. The amount of recombinant lysosomal enzyme 
to be administered to a patient suffering from a lysosomal 
5 disease or condition will vary. Numerous factors may be 
taken into consideration by a clinician when determining an 
optimal dose for a given subject. These factors include the 
size of the patient, the age of the patient, the general 
condition of the patient, the particular disease being 

10 treated, the severity of the disease, the presence of other 
drugs in the patient, and the like. Trial dosages would be 
chosen after consideration of the results of animal studies, 
and any available clinical literature with respect to past 
results of replacement therapy for the particular lysosomal 

15 storage disease. 

For example, therapeutic amounts of recombinant hGC and 
IDUA and modified hGC and IDUA produced according to the 
invention may in each instance encompass dosages of between 
about 10 and about 500 mg per 7 0 kg patient per month, 

20 depending upon the severity of the patient's symptoms of the 
Gaucher 's or Hurler's disease. 

The amount of recombinant lysosomal enzyme of the 
invention administered to the patient may be decreased or 
increased according to the enzymatic activity of the 

25 particular lysosomal enzyme. For example, administration of 
a recombinant lysosomal enzyme of the invention which has 
been modified to have increased enzymatic activity relative 
to the native human or animal enzyme will require 
administration of a lesser amount to the patient than a 

3 0 native human or animal lysosomal enzyme having lower 
enzymatic activity. 

In addition, the amount of recombinant lysosomal enzyme 
administered to the patient may be modified over time 
depending on a change in the condition of the patient as 

35 treatment progresses, the determination of which is within 
the skill of the attending clinician. 
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The invention also provides pharmaceutical formulations 
for use of the recombinant lysosomal enzyme in treating 
lysosomal storage diseases. The formulations comprise a 
recombinant lysosomal enzyme of the invention and a 
5 pharmaceutical^ acceptable carrier. A variety of aqueous 
carriers may be used, e.g., water, buffered water, 0.4% 
saline , 0.3% glycine, and the like. The pharmaceutical 
formulations may also comprise additional components that 
serve to extend the shelf-life of pharmaceutical 

10 formulations, including preservatives, protein stabilizers, 
and the like. The formulations are preferably sterile and 
free of particulate matter (for injectable forms) . These 
compositions may be sterilized by conventional, well-known 
sterilization techniques. 

15 The compositions may contain pharmaceutical ly acceptable 

auxiliary substances as required to approximate physiological 
conditions, such as pH adjusting and buffering agents, 
toxicity adjusting agents and the like, e.g., sodium acetate, 
sodium chloride, potassium chloride, calcium chloride, sodium 

2 0 lactate, etc. 

The formulations may be adapted for various forms of 
administration, including intramuscularly, subcutaneously , 
intravenously and the like. The subject formulations may 
also be formulated so as to provide for the sustained release 
25 of a lysosomal enzyme. Actual methods for preparing 

parenterally administrable compositions and adjustments 
necessary for administration to subjects will be known or 
apparent to those skilled in the art and are described in 
more detail in , for example, Remington's Pharmaceut ica 1 

3 0 Science. 17th Ed. . Mack Publishing Company, Easton, Pa. 

(1985) , which is incorporated herein by reference. 

The invention is illustrated in the working examples 
described Infra, for the expression of hGC in tobacco. 
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6. EXAMPLE 1: PRODUCTION AND ISOLATION OF RECOMBINANT 
MODIFIED hGC FROM TRANSGENIC TOBACCO PLANTS 

The subsections below describe the production of an 

enzymatically active modified human glucocerebrosidase (hGC) 

in tobacco. 

6.1. CONSTRUCTION OF A MODIFIED hGC EXPRESSION CONSTRUCT 
AND INSERTION INTO A PLANT TRANSFORMATION VECTOR 

6.1.1. PROMOTER: hGC EXPRESSION CONSTRUCT 

E. coll containing the hGC cDNA sequence cloned from 
fibroblast cells, as described (Sorge et al . , 1985, supra), 
was obtained from the ATCC (Accession No. 65696) . Oligo- 
nucleotide primers GC1 (corresponding to the amino terminus 
of the hGC coding region as shown in FIG. 1) , and GC4 
(corresponding to the carboxy terminus of the hGC coding 
region) , were used to amplify the hGC cDNA sequence using the 
polymerase chain reaction (PCR) . Primer GC1 was designed to 
include the hGC ATG initiation codon and to generate a 5 ' 
Xbal site. Primer GC4 , complementary to hGC mRNA, does not 
include the stop codon for the gene and was designed to 
generate an EcoRI restriction site. The design of 
oligonucleotide GC4 also corrected an altered base in the 
ATCC sequence (GenBank/EMBL #M11080) , thus producing an 
Arg-Arg-Gln sequence upstream to the site where a FLAG™ 
epitope will be inserted. 

The 1.9 kb fragment generated by PCR was purified by 
agarose gel elution, digested with Xbal and EcoRI , and 
ligated into the similarly digested plasmid, Bluescript SK" 
(Stratagene) . This cloning vector was chosen because of its 
small size (2.9 kb) and its extensive multiple cloning 
region. 

The MeGA promoter, comprising a 4 56 bp fragment (FIG. 
11) (SEQ ID NO: 5) as modified from the tomato HMG2 promoter 
(Weissenborn et al . , 1995, Phys. Plantarum 93:393-400), was 
used to drive the expression of the hGC gene. The MeGA 
promoter is inducible and has a low basal expression in 
unstressed plant tissues, but is highly induced in both 
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immature and mature tissues by the process of mechanical gene 
activation (MGA) , or by a variety of chemicals that induce 
plant defense responses. MGA includes but is not limited to 
the mechanical shredding of leaf tissue, for example, into 2 
5 mm strips, followed by storage at room temperature on Whatman 
3 MM chromatography paper moistened with sterile water in a 
sealed plastic bag. The expression of a MeG A : GUS construct 
has been monitored in transgenic tobacco plants from seedling 
stage to flowering and it showed no loss of inducible 

10 activity as plants reached maturity. 

The 456 bp MeGA promoter was PCR-amplif ied using primers 
which incorporated a NotI restriction site at the 5' end of 
the fragment and a Xbal site at the 3' end of the promoter. 
This fragment also contained the 5 ' -untranslated leader of 

15 its native tomato sequence and thus provided all necessary 5' 
elements for expression of the fused hGC sequences. 
Following amplification, the fragment was PAGE-purif ied, 
digested with NotI and Xbal, and ligated into the plasmid 
containing the hGC coding region, which had also been 

2 0 NotI /Xbal digested, to produce a MeGA: hGC fusion. 

6.1.2. GENERATION OF A MeGA g hGC i FLAG™ CONSTRUCT 

In order to facilitate detection and purification of the 
hGC gene product, a FLAG™ epitope coding sequence was fused 

25 in frame to the C-terroinus of the hGC coding sequence. The 
FLAG"" epitope (IBI) is the octapeptide Asp-Tyr-Lys-Asp-Asp- 
Asp-Asp-Lys (or DYKDDDDK) (SEQ ID NO: 10) designed to be a 
hydrophilic marker peptide situated on a protein surface to 
facilitate antibody interactions (Shelness, 1992, Epitope 

30 1:11-17; Hopp et al . , 1988, Bio/Tech. 6:1204-1210). 

A double-stranded oligonucleotide (FIG. 1) was 
synthesized which incorporated: (a) a 5' EcoRI restriction 
site which creates an in-frame fusion with the engineered hGC 
C-terminus EcoRI site; (b) the FLAG m octapeptide coding 

35 region; (c) a stop codon following the epitope; and (d) a 3' 
SstI/ EcoRI site. The DNA encoding FLAG™ was PAGE-purif ied, 
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digested with EcoRI , and the fragment encoding FLAG'" inserted 
into the EcoRI site of the MeGA : hGC plasmid, and tested for 
insert orientation. 

The translational fusion was tested by in vitro 
5 transcription using T3 RNA polymerase driven by the T3 

promoter in the pBluescript SK- vector following excision of 
the MeGA promoter, and in vitro translation in the presence 
of 35 S-methionine using rabbit reticulocyte lysates (BRL) . 
The major translation product was about 56-59 kDa, consistent 

10 with the expected size of the hGC: FLAG™ fusion product (59 
kDa) . In addition, the hGC: FLAG™ fusion construct was 
completely sequenced using the dideoxy-sequenase system 
(USB). The nucleotide sequence of the hGC: FLAG™ fusion (SEQ 
ID NO: 3) is shown in FIG. 9; the deduced amino acid sequence 

15 (SEQ ID NO: 4) is shown in FIG. 10. The construction altered 
amino acid residue 545 to an arginine (R) and added ten amino 
acid residues, including the FLAG™ octapeptide, to the 
car boxy terminal of hGC. See FIG. 10. 

2 0 6.1.3. INSERTION OF THE MeGA : hGC : FLAG™ CONSTRUCT 

INTO A PLANT TRANSFORMATION VECTOR 

The MeGA : hGC : FLAG™ expression construct was excised 
from the pBluescript vector by digestion with SstI and 
ligated into the corresponding restriction site in the 

2 5 multiple cloning region of the plant binary vector pBIB-KAN 
(Becker, 199 0 , Nucl. Acids Res. 18:203) to form plasmid 
CTProl : hGC : FLAG™ . As shown in FIG. 1, insertion of the 
MeGA: hGC: FLAG™ expression construct correctly positioned a 
plant transcriptional terminator for the construct. In 

30 addition, the binary vector carries an NPTII gene within the 
transfer DNA (T-DNA) which allows for selection of 
transformed plant cells based on kanamycin resistance. The 
engineered plasmid was transformed into E. coli strain DH5a 
and tested for correct insertion prior to mobilization into 

35 Agrobacterium tumefaciens strain LBA4404 (Hoekma et al . , 
1983, Nature 303:179-180). 
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6.2. INTRODUCTION OF THE MeGA ; hGC : FLAG™ EXPRESSION 

CONSTRUCT INTO TOBACCO AND ASSESSMENT OF hGClFLAG™ 
EXPRESSION _ 

6.2.1. GENERATION OF TRANSGENIC TOBACCO PLANTS 

CONTAINING THE MeGA : hGC I FLAG™ CONSTRUCT 

5 Agrobacterium-mediated transformation (Horsch et al . , 

1984, Science 22 3:496-498) was used to stably integrate the 

modified T-DNA sequence containing the MeGA : hGC : FLAG™ 

construct into the genome of tobacco. Leaf discs excised 

from aseptically grown seedlings of tobacco (Nlcotiana 

10 tabacum) cvs . Xanthi (a non-commercial variety) and VA116 (a 

commercial, flue-cured variety) were briefly incubated in a 

bacterial suspension (10 9 cfu/ml) of A. tumefaciens containing 

the engineered plasmid (FIG . 2A) , and co-cultivated on plates 

containing a nurse-culture of cultured tobacco cells for 48 

15 hr. The leaf discs were then transferred to MS media 

(Murashige & Skoog, 1962, Physiol. Plant. 15:47 3-497) 

containing 100 mg/L kanamycin and 9.12 /iM zeatin, which is a 

selective "shooting" medium that blocks the growth of 

bacteria and untransf ormed plant cells, and encourages shoot 

20 formation (Horsch et al . , supra). 

Shoots were observed three weeks post-inoculation (FIG. 

2B) and were excised and placed on selective rooting media 

(100 mg/L kanamycin, 10 (M indole-3-acetic acid in MS media). 

After 1 week, the rooted plantlets (FIG. 2C) were transferred 

25 to sterile potting soil and placed in the greenhouse (FIG. 

2D) . Additional shoots were excised and rooted over the next 

4 weeks with a total of 4 5 individual transf ormants being 

brought to soil (FIG. 2E) . The presence of the gene 

construct did not appear to have any effect on the growth or 

30 development of these transf ormants . 

6.2.2, SOUTHERN ANALYSIS OF MeGA : hGC S FLAG™ 

INSERTIONS IN TRANSGENIC PLANTS 

The stable insertion of the MeGA: hGC: FLAG™ construct was 

35 confirmed by genomic Southern hybridization analysis. Total 

DNA was isolated from leaf tissue of eight young regenerants 

and digested with Hindlll, which cuts only once within the 
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introduced DNA (see FIG. 1). The second Hindlll site flanks 
the introduced DNA and is located within the plant's genomic 
DNA, Thus, when probed with hGC cDNA sequences (1.7 kb 
Hindlll fragment from pBluescript intermediate vector) 3 ' of 
5 the Hindlll site, each fragment should be a distinctive size 
and represent an independent insertional event within the 
plant genome. 

Five of the eight putative transf ormants tested showed 
multiple hGC inserts (FIG. 3) . Four of these plants (X-l, 

10 X-8, X-9 and X-ll) were derived from the Xanthi cultivar. 
One plant (V-l) was derived from cultivar VA116. 
Transf ormant X-8 had less DNA loaded and showed two bands 
upon longer autoradiographic exposure. In addition, high 
levels of hGC were detected in other transf ormants for which 

15 Southern hybridizations were not carried out, including a 
plant designated X-27. 



6.2.3. NORTHERN ANALYSIS OF TRANSCRIPTIONAL 
ACTIVATION OF THE MeGA : hGC : FLAG™ 
TRANS GENE 

As described supra, the MeGA promoter is essentially 
inactive in unstressed leaves, but is activated by MGA (see 
FIG. 4) or by treatment with chemicals that induce plant 
defense responses. In order to demonstrate that transgenic 
plants express hGC: FLAG™ mRNA in the expected inducible 
expression pattern, transformed plant tissue was induced by 
MGA, i.e., by shredding the leaf tissue into 2 mm strips, 
followed by incubation of Whatman #1 paper moistened with 
sterile water within a ZipLoc™ plastic bag and incubated at 
room temperature for 24 hrs. Total RNA was isolated by 
standard guanidino-thiocyanate methods from leaf tissue of 
untransf ormed and transformed plants immediately upon 
excision (time 0) , or at 24 hr after MGA. 

As shown in FIG. 4, hGC: FLAG™ mRNA levels were 
undetectable in leaves of X-ll at the time 0, but showed a 
marked increase in hGC transcript levels 2 4 hr after MGA. A 
more detailed time course of a second plant, V-l, showed 
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detectable mRNA by 4 hr, maximal RNA levels at 24 hr, and 
mRNA levels declining at 48 hr. In addition, transcript 
levels increased in response to chemical defense elicitors 
compared to MGA. This pattern of expression is exactly that 
5 expected of a transgene construct linked to the MeGA promoter 
(Park et al • , 1992, PI. Mol. Biol. 20:327-331; Yang et al . , 
1991, PI. Cell 3:397-405). 
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6.2.4. IMMUNODETECTION OF THE hGC : FLAG™ PROTEIN 
IN TRANSGENIC PLANT EXTRACTS 

As described supra, the hGC: FLAG™ fusion construct was 
designed to utilize the FLAG™ epitope to facilitate detection 
and purification of the hGC: FLAG™ fusion protein. Seven 
weeks after plants were potted in soil, leaf discs from 35 
plants of the 45 transf ormants described above were harvested 
(and thereby wounded) to induce transgene expression. 
Extracts from the leaf discs of control plants and transgenic 
plants were spotted on nitrocellulose membranes for 
immuno-dot blot analysis. Monoclonal antibodies (anti-FLAG 
M2, IBI) against the FLAG™ epitope, in conjunction with the 
Western Exposure™ chemiluminescent detection system 
(Clontech, Inc.), were used to test for immuno- reactive 
material. Of the 35 plants tested, 25 showed significant 
transgene expression. 

Western analysis of extracts from wounded leaves of 
untransformed plants and transformed plants were tested for 
immuno-reactivity to polyclonal antibodies raised against hGC 
(FIG. 5B) . These antibodies have not shown binding to any 
mammalian proteins other than the acid 0-glucosidase , i.e., 
glucocerebrosidase of chimpanzees. Extracts from transgenic 
plants showed strong immuno-reactivity by a single protein 
band with an apparent molecular weight of about 66-69 kDa 
(FIG. 5B) . The size of the immuno-reactive protein was 
reduced to about 58 kDa after W-glucanase treatment, 
indicating that the enzyme was glycosylated. Analogous 
Western immunoblots probed with anti-FLAG™ antibodies showed 
additional similar molecular weight bands (FIG. 5A) , 
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suggesting that both the polyclonal antibody to hGC and the 
anti-FLAG T * antibody recognize the same fusion protein 
product . 

5 6.2.5. ENZYMATIC ACTIVITY IN TOBACCO EXTRACTS 

Plant tissues were tested for hGC activity using a 
sensitive and convenient assay that is widely utilized in 
Gaucher disease research (Grabowski et al . , 1990, in: 
Critical Reviews in Biochemistry and M olecular Biology . 

10 25:385-414, CRC Press, Inc.)- This assay uses the 
f luorometric substrate, 4-methylumbellif eryl-/3- 
D-glucopyranoside (4MuGlc) (the "4MuGlc assay"). An increase 
in absorbance at 4 60 nm results from cleavage of 4MuGlc, and 
indicates the presence of enzymatic activity. 4MuGlc also 

15 serves as a substrate for endogenous plant /3-glucosidases 
which have been detected in leaves of both control and 
transgenic plants. However, several distinctive properties 
of hGC were used to distinguish between endogenous 
glucosidase activity and hGC activity (TABLE 1) . The 

2 0 differences in solubility together with the use of anti-FLAG™ 
affinity system for purification of the hGC: FLAG™ were 
employed to solve the problem of separating hGC: FLAG"* from 
the endogenous plant B-glucosidases (Table 2, FIG. 8). 
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TABLE 1. Comparisons of endogenous tobacco 0-glucosidase and 
hGC : FLAG™ 





CHARACTERISTICS 


ENDOGENOUS 


hGC: FLAG™ 


c 


Solubility 


Present in soluble 
extract in 0.1% 
iriLun a iuu 
buffer 


Membrane- 
associated, 
requiring nign 
Triton 

concentration , 
sonication, or 
freeze/thaw to 
solubilize 


in 


Response to MGA 


High levels in 
unstressed leaves , 
declines approx. 
80% post-MGA 


Absent in 
unstressed leaves , 
induced 24-48 hrs 
post-MGA 


15 


Inhibition 


Weakly inhibited 
by conduritol B 
epoxide (CBE) 
(Sigma) 


Strongly inhibited 
by CBE 




Substrate 


Active with MuGlc 


Active with MuGlc 


20 


Antibody response 


No immuno- 
reactivity to 
anti-FLAG™ or 
anti-hGCase 
antibodies 


Immuno -re active to 
both ant i -FLAG™ 
and anti-hGCase 
antibodies 



6.2*6. ACCUMULATION OF hGC: FLAG™ PROTEIN IN 
TOBACCO TISSUES 

25 In order to determine the best length of incubation time 

post-MGA for optimum yield of hGC: FLAG™ protein and hGC 
enzyme activity, extracts were analyzed from transgenic 
leaves at 0, 2, 4, 8, 16, and 24 hrs post-MGA. Plant tissue 
(0.5 gm) was ground using dry ice and a coffee bean grinder. 

30 To solubilize hGC: FLAG™, the ground tissue was resuspended in 
1.0 ml of extraction buffer containing 25 mM sodium citrate 
pH 7.0 f 1% (w/v) sodium taurocholate , 4 mM 6-mercaptoethanol f 
and 5 mM EDTA . The homogenate was frozen in a dry 
ice/ethanol bath for 30 min and thawed at 4°C for 2 hrs. 

35 This freeze-thaw procedure was repeated. Cell debris was 
pelleted at 14,000 x g for 15 min. at 4°C. The cell free 
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supernatant was collected and brought up to 4 0% (v/v) 
glycerol in order inhibit the denaturation of hGC : FLAG™ 
protein. 

Western analysis was carried out on 10 fig of soluble 
5 protein from leaf extracts to test for immuno-reactivity to 
polyclonal antibodies raised against hGC (FIG. 5B) and 
monoclonal antibodies against the FLAG™ epitope (FIG. 5A) . 
The highest level of induction of hGC: FLAG™ protein occurred 
between 8 and 12 hrs post-MGA. 
10 To determine the optimum time post-MGA for obtaining the 

highest level of hGC enzymatic activity, 0.1 of leaf 
extracts were assayed using the 4MuGlc assay. The highest 
hGC activity was found in extracts from 12 hrs post-wounded 
tissue (FIG. 6) . 

15 

6.3. PURIFICATION OF hQC I FLAG™ FROM TOBACCO EXTRACTS 

Forty gms of post-wounded (12 hrs) tissue was ground to 
a fine powder using dry ice and a coffee bean grinder. One 
hundred mis of extract buffer were added and the sample was 

20 made into a slurry using a polytron (Brinkman Scientific) . 

The extract was frozen in a dry ice/ethanol bath for 1 hr and 
thawed for 16 hrs at 4°C. Cell debris was pelleted at 14,000 
x g for 3 0 min. The supernatant was filtered through 4 
layers of cheese cloth and the filtrate was saved. An 1 ml 

2 5 aliquot was stored in 40% (v/v) glycerol for later protein 
and hGC enzymatic activity determination, while ammonium 
sulfate (AS) was gradually added with stirring to the 
remaining filtrate to 33% (w/v) final concentration and 
incubated at 4°C for 1 hr. The homogenate was cleared by 

30 centrifugation at 14,000 x g for 30 min. The supernatant was 
dialyzed overnight at 4°C against the following buffer: 0.1 
M sodium citrate, pH 6.0, 4 mM 6-mercaptoethanol and 5 mM 
EDTA. The supernatant was clarified by centrifugation at 
14,000 x g for 30 min. The cleared supernatant was 

35 concentrated (Aroicon, YM30 filters) to a final volume of 5 
mis, and 0.5 ml of the concentrated AS supernatant was saved 
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for protein and hGC enzyme activity analysis. The hGC: FLAG™ 
in l ml of concentrated supernatant was purified by affinity 
chromatography using an anti-FLAG™ affinity column. 

To utilize the FLAG™ epitope for purification of the 
5 hGC: FLAG™ protein, 1 ml of leaf extract prepared as above was 
applied to a 1 ml anti-FLAG™ M2 affinity column. The column 
was previously equilibrated with phosphate-buffered saline 
(PBS; 50 mM, pH 6.4) containing 10% glycerol and 4 mM /3- 
mercapto-ethanol at 4°C. After several washes with PBS, the 

10 bound hGC: FLAG™ protein was eluted with three 1 ml aliquots 
of purified FLAG™ peptide (IBI) , i.e., 1 ml at 500 ng/ml, 
followed by 2 x 1 ml at 250 /ig/ml. Eluted material was 
slot-blotted onto a nitrocellulose membrane and tested for 
immuno-reactivity to the anti-FLAG™ M2 antibody, and analyzed 

15 by SDS-PAGE, and stained with Commassie blue to determine 
relative purity (FIG . 7A) . No immuno-reactive material was 
eluted in the first fraction since release of the bound 
hGC: FLAG™ protein requires equilibration with the peptide. 
As a consequence, the second and third eluted fractions 

20 contained the majority of immuno-reactive material. SDS-PAGE 
analysis of anti-FLAG™-purif ied hGC: FLAG™ protein showed a 
single band co-migrating with the anti-FLAG™ immuno-reactive 
protein (FIG. 7A) . 

In order to utilize the properties of the glycans 

25 present on the hGC: FLAG™ protein for purification purposes, 
hGC: FLAG™ protein was also isolated using a concanavalin-A 
(ConA) affinity column (Sigma) . Concentrated tissue extract 
(1.5 ml) was loaded onto a 1.5 ml bed volume of ConA in 
column buffer (0.1 M sodium citrate pH 6.5, 0.15 M sodium 

30 chloride) . An equal volume of column buffer was added to the 
concentrated extract and passed through the column twice at 
4°C. The ConA column was washed three times with column 
buffer using three times the bed volume of buffer. The bound 
hGC: FLAG™ was eluted with 5 mis of 0.1 M methyl 

3 5 a-D-mannopyranoside (Sigma) followed by 5 mis of l M methyl 
a-D-mannopyranoside. Fractions were collected and assayed 
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for protein content and hGC enzymatic activity. All 
fractions containing hGC enzyme activity were concentrated 
(Amicon, YM30 filters) to a final volume of 0.5 ml. To 
stabilize the hGC enzymatic activty of the hGC: FLAG™ protein, 
5 the concentrated extract was made 40% (v/v) in glycerol and 
stored at 4°C. SDS-PAGE analysis of the ConA purified 
hGC: FLAG™ protein (FIG. 7B) showed a band migrating at 66-69 
kd and three lower molecular weight bands that stained 
equally with Commassie blue. 
10 Enzyme activity and protein determination of fractions 

from each step in the purification indicate that the most 
effective method to purify hGC: FLAG™ was to employ anti-FLAG™ 
affinity chromatography followed by the ConA affinity 
chromatography (see Table 2 and FIGS. 7A-B) . 

15 

TABLE 2. PURIFICATION OF hGC: FLAG™ FROM TOBACCO EXTRACTS 



Fraction 


Protein Cone. Specific 
activity 
(nraole 4MU/min/pg/ml ) 


% Activity 
Recovered 


Fold 

Purf ication 


40 gin 9 FW 


2 rag/ml 


♦0.027 


100 


1 


33% AS-sup 


2 . 5 mg/ml 


*0.625 


180 


13 


ConA 


0.1 mg/ml 


+0.81 


12.5 


240 


FLAG 


7.2 vg/ml 


+0.84 


N.D. 


N.D. 



* Since 4MUGlc is not a specific substrate, this specific activity 
2 5 represents both plant glucosidase and hGC activity. 

+ Plant glucosidase does not bind to ConA or ant i— FLAG™* affinity columns 
(data not shown), therefore, this enzymatic activity is from hGC: FLAG 
alone. 



30 6.4. PRODUCTI ON OF hGC: FLAG™ PROTEIN FROM TOBACCO PLANTS 

An estimation can be made on the amount of hGC: FLAG™ 
extracted per gm fresh weight of tobacco plant tissue or per 
mg soluble protein from slot blot western analysis of initial 
crude extracts using anti-FLAG™. Approximately 2 rag/ml of 
35 soluble protein were extracted per 0.5 gm of fresh weight 
plant tissue. Western slot blot analysis of 1 /xl of crude 
extract indicates the presence of approximately 0.5 to 0.6 fxg 
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of hGC : FLAG™ (FIG. 8). Based on these results, a single 
mature tobacco plant comprising about 1.6 kg of fresh weight 
of tissue will contain about 2 . 5 gm of hGC: FLAG™ per plant. 
Accordingly, a standard acre of tobacco planted to 6,000 
5 plants could potentially produce 15 kg of hGC: FLAG™ (Table 
3) . 



TABLE 3. EXT PACT ABLE hGC: FLAG™ PER ACRE OF TOBACCO 



Tissue 


Soluble Protein 
Total 


Extractable 
hGC : FLAG™ 


*1 gm 


4 - 5 mg 


1.5 mg 


1.6 kg/plant 


6 - 8 gm 


2.4 gm 


6,000 PLANTS/ACRE 
(Standard field) 






9,600 kg 


38 - 48 kg 


14.4 kg 



* These estimations are based on slot blot westerns using 
anti-FLAG and crude extracts from 0.5 gm - 50 gm of post- 
wounded tissue. 



20 

7. EXAMPLE 2: PRODUCTION AND PURIFICATION OF IDUA IN 
TRANSGENIC TOBACCO PLANTS 

The subsections below describe the production of 

enzymatically active recombinant human a-L-iduronidase (IDUA) 

25 in transgenic tobacco plants. 

7.1. CONSTRUCTION OF A PLANT TRANSFORMATION VECTOR 
CONTAINING AN IDUA EXPRESSION CO NSTRUCT 

7.1.1. IDUA EXPRESSION CONSTRUCT 

30 The first step in the construction of the desired plant 

transformation vector was to generate the human IDUA coding 
region with appropriate flanking restriction site to 
facilitate fusion to specific plant promoters and insertion 
into plant transformation vectors. A full-length human IDUA 

35 cDNA clone was provided by E. Neufeld (University of 

California, Los Angeles) . In this clone, the IDUA cDNA 
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sequence was inserted into the EcoRI site of pBS plasmid 
(Moskowitz et al . , 1992, FASEB J. 6:A77; Murray, 1987, 
Methods in Enzymol. 149:25-42). This IDUA cDNA sequence has 
been expressed in animal cell lines (Moskowitz et al . , 1992, 
5 supra, 1987, supra) and shown to contained all the 

information necessary to produce enzymatically active IDUA 
(Murray, 1987, supra). The IDUA cDNA encodes a 653 amino 
acid protein (66 kDa) including the 26 amino-terminal signal 
peptide which is cleaved as it passes through the ER 

10 membrane. To aid in the insertion of the IDUA cDNA into the 
plant vector, unique flanking Xbal and Sacl sites were 
introduced by PCR using 5 '-primer ID1 and 3 '-primer ID2, Pfu 
polymerase (Stratagene, La Jolla, CA) ; as shown in Figure 12. 
The 1.9 kb fragment generated by PCR was purified by agarose 

15 gel electrophoresis, digested with Xbal and Sacl, and ligated 
into pBS and pSP64polyA (Gibco, a vector for in vitro 
transcription/translation) . The PCR-amplif ied IDUA coding 
sequence was sequenced prior to insertion into the expression 
constructs- The nucleotide and deduced amino acid sequences 

2 0 of the amplified IDUA coding sequence are shown in FIGS. 19 

(SEQ ID NO: 8) and 2 0 (SEQ ID NO: 9), respectively. The PCR- 
amplified IDUA coding sequence differ from that originally 
published by E. Neufeld at positions 931 and 932. The PCR- 
amplif ied IDUA sequence has the dinucleotide CG instead of 

25 the original GC at those positions. Accordingly, the deduced 
amino acid sequence of the PCR-amplif ied IDUA has a 
glutamate, instead of a glutamine, residue at position 282. 
In vitro transcription of the PCR-amplif ied IDUA sequence in 
a pSP64polyA:IDUA vector and rabbit reticulocyte 

30 lysate-mediated in vitro translation of the resultant 

transcript produced protein having a molecular size expected 
for IDUA. 

The PCR-amplif ied IDUA coding region was inserted 
downstream of two distinctly regulated plant promoters: 1) 

3 5 the MeGA promoter and 2) the 358*™ promoter. As discussed 

above, the MeGA promoter shows little or no expression in 
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most plant tissues but is strongly inducible resulting in 
significant transgene product accumulation 12 to 48 hours 
after induction of the MeGA promoter. The ass 1 *™ promoter is 
a widely used high-level constitutive promoter consisting of 
5 a modified CaMV 3 5S promoter containing double enhancer which 
is fused to a translational enhancer from the tobacco etch 
virus. See Cramer et al . , 1996, "High-Level of Enzymatically 
Active Human Lysosomal Proteins in Transgenic Tobacco", 
Transgenic Plants; A production System for Industrial and 
10 Pharmaceutical Proteins , eds. , Owens & Pen, John Wiley & 

Sons; Chrispeels, 1991, Annu. Rev. Plant Physiol. Plan. Biol. 
42:21-53; and Haskins et al., 1979, Pediat. Res. 13:1294- 
1297. Each promoter was ligated as a Hindlll-Xbal fragment 
upstream of the IDUA cDNA (see Figure 12) . 

15 

7.1.2. IDUA EXPRESSION/TRANSFORMATION VECTORS 

During the subcloning and vector analysis steps, 
bacterial transf ormants having any vector containing the 
5' -end of the IDUA cDNA were recovered at lower than expected 

20 frequencies. For example, multiple ligation and 

transformations of competent E • coll cells DH5a with pBS 
containing the 1.9 kb PCR amplified IDUA cDNA were required 
to generate fewer than 100 transf ormants . Among the 70 
transformants analyzed by restriction analysis of the plasmid 

25 DNA, only 2 clones contained the proper sized 1.9 kb 

fragment. One of the two clones was sequenced and found to 
contain the complete IDUA coding sequence. Colony size of 
IDUA containing transf ormant was reduced. These reduced 
efficiencies were independent of plasmid vector, presence or 

30 absence of plant promoter, IDUA expression (not fused to a 
bacterially active promoter) or bacterial host- Independent 
subcloning of the 3'- versus 5 ' -end of the IDUA cDNA 
localized an "obnoxious" region to the 5' -end of the IDUA 
sequence. DNA secondary structure or the high GC content of 

35 this region may cause intolerance in heterologous organisms. 
This effect by the 5 '-end of the IDUA cDNA has also been 
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noticed in yeast and animal cell expression systems. These 
limitations in transformation of the IDUA sequence, however, 
did not preclude successful isolation and characterization of 
the desired IDUA expression and transformation constructs. 
5 For both promoter constructs, the promoter : IDUA cDNA 

fusions were excised as Hindlll/SacI fragments and ligated 
into Hindlll and Sad-digested pBIB-KAN (Figure 12) . 
pBlB-Kan is a large (>13 kb) plant transformation vector that 
provides a terminator/polyadenylation signal (pAnos) for the 

10 introduced transgene, a selectable marker (NPTII or kanamycin 
resistance) for transformed plant cells, and T-DNA border 
sequences that demarcate the DNA to be transferred (Becker, 
1990, Nucl. Acids Res. 18:203). The recombinant vectors were 
propagated in E. coli and fully characterized prior to 

X5 transfer to Agrobacterium tumefaciens . A pBIB-KAN vector 
containing the MeGA : IDUA expression construct used in T-DNA 
transformation of plants is pCT22. 

7.2. GENERATION OF TRANSGENIC TOBACCO CONTAINING THE 
20 IDUA CONSTRUCTS 

AgroJbacterium-mediated transformation was used to stably 

integrate the 35S ENH :IDUA and MeGA: IDUA constructs into the 

genome of tobacco. Approximately 80 leaf discs were excised 

from aseptically grown Nicotiana tabacum cvs . Xanthi 

25 seedlings for each gene construct and inoculated with 

suspension cultures of A . tumefaciens strains containing the 
IDUA expression/transformation vectors. Following a 48 hour 
co-cultivation period, the leaf discs were transferred to 
selection media containing kanamycin and hormones that 

30 promote shoot formation. Although numerous shoots (4-10 per 
disc) generally appear 2-3 weeks after transfer to selection 
media, the IDUA-transf ormed shoots appeared late, i.e.,- after 
3-5 weeks, and were few in number (0-1 per disc) . Induction 
of root formation was also delayed in the IDUA-transf ormed 

35 shoots compared to shoots containing other transgene 
constructs. A final yield of seven 35S EKH :IDUA and ten 
MeGA: IDUA plantlets were transferred to soil. Once in soil, 
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all plants grew to maturity with normal morphology, 
flowering, and seed production. IDUA-expressing progenies 
showed slight retardation in early growth (FIG. 13B) but were 
indistinguishable in size and appearance from untransf ormed 
5 plants at full maturity. 

7.3. SOUTHERN CHARACTERIZATION OP TRANSGENIC PLANTS 

Transgenic plants were initially selected based on 
kanamycin resistance. The stable insertion of the MeGA: IDUA 

10 gene construct was confirmed by genomic Southern 

hybridization analysis. Total DNA was isolated from leaf 
tissue of nine transgenic plants and digested with Hindlll, 
and analyzed by Southern hybridization using the IDUA cDNA as 
probe. The nine putative transf ormants analyzed showed one 

15 to three copies of the IDUA insert and no indication of 

rearrangements or deletions- This transgene copy number is 
typical of transgenic tobacco engineered with other 
constructs via Agr*oJbacteriuro. 

20 7.4. CHARACTERIZATION OF IDUA EXPRESSION IN 
TRANSGENIC PLANTS 

7.4.1. IMMUNO-DETECTION OF IDUA PROTEIN IN 
PLANT EXTRACT 

Antibodies made to the native and denatured IDUA from 
25 CHO cells were obtained from E. Kakkis (Harbor-UCLA Medical 
Center, Los Angeles, CA) . By immuno-slot blot and SDS-PAGE 
Western analysis, the antibodies were found not to react with 
any proteins in untransf ormed or pBIB-Kan (transformed vector 
alone) transgenic tobacco tissue extracts from uninduced or 
30 induced leaf tissue. When purified IDUA from CHO cells was 
seeded to untransf ormed tobacco extracts, there was no 
diminution in the level of IDUA detected as compared to that 
detected in extraction buffer containing the same 
concentration of purified IDUA. This finding indicates that 
3 5 tobacco extract does not inhibit immuno-detection of IDUA. 
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Leaf tissues from seven independent transgenic plants 
were harvested, homogenized in 3X volume of extraction buffer 
(PBS with 0.1% Triton X100, 200 jiM PMSF, 1 fiM pepstatin, 4 fiM 
leupeptin) and the extracts cleared of cell debris by 
5 centrifugation at 12,000 X g for 30 min. Twenty-five /xg of 
total soluble protein from each extract was heat-denatured 
and slotted onto OPTITRAN membrane (S&S) . Purified IDUA 
protein in amounts ranging from 20 ng to 400 ng were added to 
the membrane to serve as comparison standards. Based on 

10 antibody detection using chemi luminescence , no immuno- 

reactive IDUA protein was found in the extracts of any of the 
35S ENH :IDUA transgenic plants. This constitutive promoter 
also poorly expressed human protein C (<0.02% of soluble 
protein). Based on these findings, the 35S ENH : IDUA-containing 

15 plants were not analyzed further. 

The MeGA promoter is inactive in tobacco leaves in the 
absence of induction. To obtain IDUA expression, leaves were 
harvested, induced by mechanical wounding and incubated at 
room temperature under high humidity (i.e., the wounded 

20 leaves are wrapped in moist filter paper in sealed bags or 
layered in a container with buffer gently swirled over the 
tissue) to allow de novo synthesis of the transgene product. 
In an initial screen of ten MeGA: IDUA-containing plants, 
tissue extracts were used for immunodot-blot analyses (see 

2 5 above) . The extracts showed little or no IDUA content for 
all plants. Later analyses revealed that IDUA was secreted 
from the leaves and leached out onto the filter paper during 
the incubation step. This was somewhat surprising because 
recovery of extracellular proteins from intact leaf generally 

30 requires vacuum- induced buffer infiltration of the leaf (see 
Parent & Asselin, 1987, Can. J. Bot. 62:564-569; Regalado & 
Ricardo, 1996, Plant Physiol. 110:227-232). As described 
below, the expression procedure was subsequently modified to 
include a post-induction incubation step that involved gentle 

35 rotation of buffer over the wounded tissue, which permitted 
recovery of IDUA protein and activity in the incubation 



-52- 



WO 97/10353 



PCT/US96/14730 



buffer. Subsequent analyses were focused primarily on one 
plant, IDUA-9 also known as CT40-9, since preliminary tests 
show detectable levels of IDUA activity and anti-IDUA 
immuno-reactive material. IDUA-9 contains 3 copies of the 
5 MeGA: IDUA construct. 

7.4.2. NORTHERN ANALYSIS SHOWS ACTIVATION OF THE 
MEGA; IDUA TRANSGENE 

In order to demonstrate induction of the MeGA promoter 

„ and accumulation of IDUA mRNA, total RNA was isolated 
10 

(Rutter, 1981, J. Biol. Chem. 91:468-478) from IDUA-9 leaves 
before and after induction. As shown in Figure 14A, IDUA 
mRNA of the expected size (approximately 2.2 kb) was detected 
at low basal levels in uninduced tissue and showed a marked 
5 increase at 8 hrs post-induction and reached a maximum level 
at 27 hrs post-induction. This pattern is similar to 
transgene induction kinetics seen with other MeGA-driven 
constructs (e.g., hGC : FLAG 1 * ) . The smaller hybridizing RNA 
species also accumulated after induction. Analogous lower 
20 molecular weight RNAs have not been detected in hGC : FLAG™ 

expressing plants and may be unique to the IDUA-9 plant or a 
consequence of the IDUA sequence. 

7.4.3. WESTERN ANALYSIS OF HUMAN IDUA LOCALIZED 
TO TOBACCO 

25 

The induced IDUA-9 tissues were also used for protein 
extracts. Western blot analysis showed CHO-derived IDUA and 
IDUA from tobacco tissue migrated very similarly in SDS-PAGE 
(Figure 14B) . The IDUA (92 kD) from IDUA-9 tobacco extract 
30 migrated slightly faster than secreted IDUA from CHO cells. 
This presumably is due to differences in glycan composition. 
However, the similarity in size suggests that the tobacco 
produced recombinant IDUA was also glycosylated. 
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7.4.4* IDOA SYNTHESIZED IN TRANSGENIC TOBACCO IS 
SECRETED 

As discussed above, CHO cells secret recombinant IDUA 

into the media. To determine if tobacco also secret 

5 recombinant IDUA into the media, leaf tissue from transgenic 

IDUA-7, -8 and -9 plants were induced for 0 to 34 hrs and 

placed in a plastic petri dish with incubation buffer (PBS) . 

At 0 hr, incubation buffer was used to wash the induced 

tissue and the wash stored frozen. Fresh buffer was added to 

1Q the induced tissue and incubated at room temperature. At 8 
hrs, the buffer was removed and frozen. Fresh buffer was 
added to the induced tissue and incubated further. The 
buffer was removed at 24 hrs post-induction. Fresh buffer 
was added to the induced tissue and further incubated. The 

15 final incubation buffer was removed 34 hrs post-induction and 
a tissue extract was prepared from the incubated leaf tissue. 
Fifty Ml of each incubation buffer and tissue extract was 
boiled and slotted onto OPTITRAN membrane. A range of 
control IDUA protein from 0 to 40 ng was also blotted and 

20 IDUA was detected using anti-IDUA antibodies. As shown in 
Figure 15, IDUA protein was present in the incubation buffer 
following induction in all three transgenic tissue analyzed. 
This indicates that transgenic tobacco secret IDUA after 
synthesis. 



25 



30 



35 



7.4.5. THE TOBACCO-SYNTHESIZED IDUA IS 
BN2 YMATI C ALL Y ACTIVE 

One of the most critical factor in assessing the utility- 

of plant-synthesized recombinant IDUA is whether the IDUA is 

enzyraatically active. Enzyme activity of human lysosomal 

hydrolases requires appropriate glycosylation and folding and 

heterologous expression systems often result in endoplasmic 

reticulum-localized degradation or accumulation of insoluble 

and inactive aggregates. To determine whether the 

recombinant IDUA synthesized in transgenic leaves has 

enzymatic activity, a sensitive fluorometric assay using the 

substrate, 4-Methylumbellif eryl-a-L-iduronide (4-MUI) 
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(Calbiochem, LaJolla, CA) was used (see Neufeld, E.F., 1991, 
Ann. Rev. Biochem. 60:257-280). Untransf ormed tobacco 
extracts were shown to contain no endogenous IDUA activity. 
When CHO-derived recombinant IDUA was seeded into crude 
5 extracts of untransf ormed tobacco leaves, no detectable 

inhibition of activity was found. When the tissue extracts 
from IDUA-9 transgenic plant were assayed, the extracts 
showed IDUA activity at reproducible but at relatively low 
levels (0.2 to 0.4 nmole 4-MU/hr/gm tissue). This confirms 
10 that tobacco has all the necessary machinery to synthesize 
and process IDUA into an active form. Consistent with IDUA 
distribution shown by imrouno-detection, significantly higher 
IDUA activities were detected in the secreted fraction as 
described below. 

15 

7.4.6. SECRETION AND RECOVERY OF TOBACCO- 
SYNTHESIZED RECOMBINANT IDUA 

Significant portion of the recombinant IDUA produced in 

transgenic tobacco was recovered in the incubation buffer 

2Q following induction of the MeGA : IDUA gene construct (FIG. 
15) . Localization of the majority of active IDUA after 
induction and incubation was determined. This was done by 
comparing the IDUA activity and anti-IDUA immuno-reactivity 
of tissue extract with those of the incubation buffer. As 

25 shown in Figure 16, there was much higher levels of IDUA 

activity in the incubation buffer than in the tissue extract 
after induction and incubation. Moreover, the IDUA activity 
in the incubation buffer showed strong correlation with the 
the amount of anti-IDUA immuno -reactive material found in the 

30 incubation buffer, as reveal by the data presented in FIG. 
15. Thus, IDUA-expressing transgenic tobacco secret most of 
its active IDUA (about 67%) into the incubation buffer after 
induction and incubation. 

Based on activity assays and Western analysis, the 

35 specific activity of secreted IDUA was estimated to be about 
64 U/Mg protein. In comparison, purified IDUA enzyme from 
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engineered CHO cells has a specific activity of about 242 
protein. 

Variation in transgene expression levels is very common 
in transgenic plants due to "positional" effects caused by 
5 the site of transgene insertion within the host genome. The 
IDUA activity levels in three independent IDUA-expressing 
transgenic plants (i.e., IDUA-7 , IDUA-8 and IDUA-9 ) were 
examined. Among these transgenic plants, IDUA-9 has the 
highest IDUA activity (FIG. 17) . The relative amount of 
10 active IDUA remaining in the cell, as reflected by the 
activity present in tissue extract, after 34 hrs of 
incubation ranged from 14% to 3 5% of the total activity ( FIG - 
17) . 

The above-identified three transgenic plants were 
15 identified in a screen of about fifty independently 

transformed plants. This is a relatively small scale screen. 
It is reasonable to expect that larger scale screenings of 
IDUA-engineered plants will yield plants that produce active 
IDUA at levels higher than those of the plants disclosed 
2 0 herein. 

7.4.7. PURIFICATION AND YIELD OF IDUA FROM 
TRANSGENIC TOBACCO 

The yield of recombinant IDUA from IDUA-9 was estimated 

25 to be about 6 nq/qm fresh tissue. This estimate was based on 

the material present in the incubation buffer after 34 hrs of 

incubation (see FIG. 18) . However, neither the induction nor 

the IDUA recovery procedure used was optimized. Thus, it is 

likely that higher IDUA yields may be acheived through 

30 optimization of induction and recovery procedures. It should 

be noted that the transgenic tobacco plants yielded an 

average of greater than 1 kg fresh weight of leaf at 

maturity, and that leaves can be periodically harvested from 

greenhouse-grown plants for over an year. Accordingly, 

35 cultivation of transgenic tobacco plants either in the field 

of the greenhouse offers a convenient and effective means for 

producing large amounts of IDUA. 
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8. EXAMPLE 3: PRODUCTION OF TRANSGENIC TOBACCO PLANTS 
CONTAINING AN UNMODIFIED hGC EXPRESSION CONSTRUCT 

A 3' end segment of the hGC coding sequence was PCR- 

amplified from the cDNA clone in E. coli ATCC65696 (see 

Section 6.1.1., supra) using as the 5' primer GC23 oligo, 

5 ' GCCTATGCTGAGCACAAGTTACAG3 9 (SEQ ID NO: 11), whose 5' end 

corresponds to nucleotide 894 of the hGC: FLAG sequence shown 

in FIG. 9, and as the 3' primer GC37 oligo , whose 

complementary strand has the sequence 

5 ' TTCCTTGAGCTCGtcaCTGGCGACGCCACAGGTA3 ' (SEQ ID NO: 12), a Sad 
restriction site is shown with an underline and a stop codon 
that is in-frame to the amplified hGC coding sequence is 
shown in lower case. The site of the 5' primer in the hGC 
coding sequence is 5' upstream of a Sail restriction site. 
Accordingly, the amplified DNA was cut with Sail and Sad, 
and the Sall/SacI fragment containing the 3' end of hGC 
coding sequence was inserted into the pBS intermediate vector 
containing the MeGA : hGC : FLAG™ expression construct (see FIG. 
1 and Section 6.1.2., supra) which had been cut with Sail and 
Sacl. Clones were identified that had replaced the 3' end of 
the MeGA: hGC: FLAG™ construct with the 3 ' end of hGC coding 
sequence yielding a MeGA: hGC expression construct. This 
construction eliminated the ten amino acid addition at the 
carboxyl terminal and corrected the amino acid substitution 
at residue 545 in the hGC : FLAG™ fusion, and thereby 
reconstructing an unmodified hGC coding sequence. The 
MeGA: hGC expression construct was excised from the pBS 
intermediate vector by Sacl digestion and inserted into pBIB- 
KAN to form the transformation vector pCT54 . A schematic of 
the construction of the pCT54 vector is shown in FIG. 21. 

Agrobacterlum containing pCT54 was used to tranformed 
plants and transgenic tobacco plants containing the MeGA: hGC 
expression construct were produced according to procedures 
described aboveTransgenic tobacco plants containing the 
MeGA: hGC expression construct were identified and assigned 
the designations CT54-1 to -40. Analyses of hGC enzymatic 
activity and presence of hGC in the induced tissues of 
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transgenic plants are carried out using the enzymatic assay 
described in Section 6.2.5. and the Western blot analysis 
using anti-hGC antibodies described in Section 6.2.6. 
Purification of the hGC produced in transgenic tobacco tissue 
5 is carried out using the procedure described in Section 6.3., 
except the anti-FLAG™ affinity chromatography step was 
omitted, which procedure is further modified accordingly to 
strategies and methods known in the art for purifying the hGC 
enzyme. 

10 

9. DEPOSIT OF BIOLOGICAL MATERIALS 

The following biological materials have been deposited 
with the American Type Culture Collection (ATCC) at 12301 
Parklawn Drive, Rockville, MD. 20852, in compliance with the 
15 requirements of the Budapest Treaty On The International 

Recognition Of The Deposit Of Microorganisms For The Purpose 
Of Patent Procedure, on the dates and were assigned the ATCC 
accession numbers indicated below. 



20 Deposited Material 

DNA of pCTProl: hGC: FLAG 

seeds of tobacco plant 
hGC X-ll 



seeds of tobacco plant 
hGC X-27 



DNA of pCT22 

seeds of tobacco plant 
CT4 0-9 



Deposit Date 
Sept. 14, 1995 
14, 1995 



Sept. 

Sept. 

Aug. 
Aug. 



, 14, 1995 

30, 1996 
30, 1996 



Accession No. 

97277 

97275 

97276 

97701 
97700 



3 0 DNA of pCT54 

The present invention is not to be limited in scope by 
the biological material deposited since the deposited 
embodiments are intended as illustrations of the individual 
aspects of the invention, and any biological material, or 

35 

constructs which are functionally equivalent are within the 
scope of this invention. Indeed, various modifications of 
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the invention in addition to those shown and described herein 
will become apparent to those skilled in the art from the 
foregoing description and accompanying drawings- Such 
modifications are intended to fall within the scope of the 
5 appended claims. 

Various references are cited herein; these are 
incorporated by reference in their entirety. 
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SEQUENCE LISTING 



PCT/US96/I4730 



(1) GENERAL INFORMATION: 



(i) APPLICANT: RADIN , DAVID N • 

CRAMER, CAROLE L. 

OISHI, KAREN K. 

WE I S SEN BORN , DEBORAH L. 

(ii) TITLE OF INVENTION: PRODUCTION OF LYSOSOMAL ENZYMES IN 
PLANT— BASED EXPRESSION SYSTEMS 



(iii) NUMBER OF SEQUENCES: 12 



(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Pennie & Edmonds 

(B) STREET: 1155 Avenue of the Americas 

(C) CITY: New York 

(D) STATE: New York 

(E) COUNTRY: USA 
<F) ZIP : 10036-2711 



(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.30 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 

(C) CLASSIFICATION: 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 60/003,737 

(B) FILING DATE: 14-SEP-1995 

(viii) ATTORNEY /AGENT INFORMATION: 
(A) NAME: Coruzzi, Laura A. 
<B) REGISTRATION NUMBER: 30,742 

(C) REFERENCE/DOCKET NUMBER: 7956-0011-999 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (212) 790-9090 

(B) TELEFAX: (212) 869-9741 

(C) TELEX: 66141 PENNIE 



<2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "PCR primer" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 
TTGTCTAGAG TAAGCATCAT GGCTGGC 27 
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(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "PCR primer" 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

CACGAATTCT GGCGACGCCA CAGGTAGGTG TGA 33 

(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

<A) LENGTH: 1642 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: unknown 
< D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: CDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:3: 



ATGGAGTTTT 


CAAGTCCTTC 


CAGAGAGGAA 


TGTCCCAAGC 


CTTTGAGTAG 


GGTAAGCATC 


60 


ATGGCTGGCA 


GCCTCACAGG 


TTTGCTTCTA 


CTTCAGGCAG 


TGTCGTGGGC 


AT CAGGTGCC 


120 


CGCCCCTGCA 


TCCCTAAAAG 


CTTCGGCTAC 


AGCTCGGTGG 


TGTGTGTCTG 


CAATGCCACA 


180 


TACTGTGACT 


CCTTTGACCC 


CCCGACCTTT 


CCTGCCCTTG 


GTACCTTCAG 


CCGCTATGAG 


240 


AGTACACGCA 


GTGGGCGACG 


GATGGGGCTG 


AGTATGGGGC 


CCATCCAGGC 


TAATCACACG 


300 


GGCACAGGCC 


TGCTACTGAC 


CCTGCAGCCA 


GAACAGAAGT 


TCCAGAAAGT 


GAAGGGATTT 


360 


GGAGGGGCCA 


TGACAGATGC 


TGCTGCTCTC 


AACATCCTTG 


CCCTGTCACC 


CCCTGCCCAA 


420 


AATTTGCTAC 


TTAAATCGTA 


CTTCTCTGAA 


GAAGGAATCG 


GATATAACAT 


CATCCGGGTA 


480 


CCCATGGCCA 


GCTGTGACTT 


CTCCATCCGC 


ACCTACACCT 


ATGCAGACAC 


CCCTGATGAT 


540 


TTCCAGTTGC 


ACAACTTCAG 


CCTCCCAGAG 


GAAGATACCA 


AGCTCAAGAT 


ACCCCTGATT 


600 


CACCGAGCCC 


TGCAGTTGGC 


CCAGCGTCCC 


GTTTCACTCC 


TTGCCAGCCC 


CTGGACATCA 


660 


CCCACTTGGC 


TCAAGACCAA 


TGG AG CGGTG 


AATGGGAAGG 


GGTCACTCAA 


GGGACAGCCC 


720 


GGAGACATCT 


ACCACCAGAC 


CTGGGCCAGA 


TACTTTGTGA 


AGTTCCTGGA 


TGCCTATGCT 


780 


GAGCACAAGT 


TACAGTTCTG 


GGCAGTGACA 


GCTGAAAATG 


AGCCTTCTGC 


TGGGCTGTTG 


840 


AGTGGATACC 


CCTTCCAGTG 


CCTGGGCTTC 


ACCCCTGAAC 


ATCAGCGAGA 


CTTCATTGCC 


900 


CGTGACCTAG 


GTCCTACCCT 


CGCCAACAGT 


ACTCACCACA 


ATGTCCGCCT 


ACTCATGCTG 


960 


GATGACCAAC 


GCTTGCTGCT 


GCCCCACTGG 


G CAAAGG TGG 


TACTGACAGA 


CCCAGAAGCA 


1020 
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G CT AAAT ATG 


TTCATGGCAT 


TGCTGTACAT 


TGGTACCTGG 


A CTT TCTGG C 


TCCAGCCAAA 


1080 


GCCACCCTAG 


GGGAGACACA 


CCG CCTGTT C 


CCCAACACCA 


TG CTCTTTG C 


CTCAGAGGCC 


1140 


TG TG TGG C3 PT 


CCARGTTCTG 










l inn 


CAGTACAGCC 


ACAGCATCAT 


CACGAACCTC 


CTGTACCATG 


TGGTCGGCTG 


GACCGACTGG 


1260 


AACCTTGCCC 


TGAACCCCGA 


AGGAGGACCC 


AATTGGGTGC 


GTAACTTTGT 


CGACAGTCCC 


1320 


ATCATTGTAG 


ACGTCACCAG 


GGACACGTTT 


TACAAACAGC 


CCATGTTCTA 


CCACCTTGGC 


1380 


CACTTCAGCA 


AGTTCATTCC 


TGAGGGCTCC 


CAGAGAGTGG 


GGCTGGTTGC 


CAGTCAGAAG 


1440 


AACG A CCTGG 


ACGCAGTGGC 


ACTG ATG CAT 


CCCGATGGCT 


CTGCTGTTGT 


GGTCGTGCTA 


1500 


AACCGCTCCT 


CTAAGGATGT 


GCCTCTTACC 


ATCAAGGATC 


CTGCTGTGGG 


CTTCCTGGAG 


1560 


ACAATCTCAC 


CTGGCTACTC 


CATTCACACC 


TACCTGTGGC 


GTCGCCAGAA 


TTCGGACTAC 


1620 


AAGGACGACG 


ATGACAAGTT 


GA 








1642 



(2) INFORMATION FOR SEQ ID NO: 4 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 546 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

Met Glu Phe Ser Ser Pro Ser Arg Glu Glu Cys Pro Lys Pro Leu Ser 
1 5 10 * 15 

Arg Val Ser lie Met Ala Gly Ser Leu Thr Gly Leu Leu Leu Leu Gin 
20 25 30 

Ala Val Ser Trp Ala Ser Gly Ala Arg Pro Cys lie Pro Lys Ser Phe 
35 40 45 

Gly Tyr Ser Ser Val Val Cys Val Cys Asn Ala Thr Tyr Cys Asp Ser 
50 55 60 

Phe Asp Pro Pro Thr Phe Pro Ala Leu Gly Thr Phe Ser Arg Tyr Glu 
65 70 75 80 

Ser Thr Arg Ser Gly Arg Arg Met Glu Leu Ser Met Gly Pro lie Gin 
85 90 95 

Ala Asn His Thr Gly Thr Gly Leu Leu Leu Thr Leu Gin Pro Glu Gin 
100 105 110 

Lys Phe Gin Lys Val Lys Gly Phe Gly Gly Ala Met Thr Asp Ala Ala 
115 120 125 

Ala Leu Asn lie Leu Ala Leu Ser Pro Pro Ala Gin Asn Leu Leu Leu 
130 135 140 

Lys Ser Tyr Phe Ser Glu Glu Gly He Gly Tyr Asn He He Arg Val 
145 150 155 160 
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Pro Met Ala Ser Cys Asp Phe Ser lie Arg Thr Tyr Thr Tyr Ala Asp 
165 170 175 

Thr Pro Asp Asp Phe Gin Leu His Asn Phe Ser Leu Pro Glu Glu Asp 
180 185 190 

Thr Lys Leu Lys lie Pro Leu lie His Arg Ala Leu Gin Leu Ala Gin 
195 200 205 

Arg Pro Val Ser Leu Leu Ala Ser Pro Trp Thr Ser Pro Thr Trp Leu 
210 215 220 

Lys Thr Asn Gly Ala Val Asn Gly Lys Gly Ser Leu Lys Gly Gin Pro 
225 230 235 240 

Gly Asp lie Tyr His Gin Thr Trp Ala Arg Tyr Phe Val Lys Phe Leu 
245 250 255 

Asp Ala Tyr Ala Glu His Lys Leu Gin Phe Trp Ala Val Thr Ala Glu 
260 265 270 

Asn Glu Pro Ser Ala Gly Leu Leu Ser Gly Tyr Pro Phe Gin Cys Leu 
275 280 285 

Gly Phe Thr Pro Glu His Gin Arg Asp Phe lie Ala Arg Asp Leu Gly 
290 295 300 

Pro Thr Leu Ala Asn Ser Thr His His Asn Val Arg Leu Leu Met Leu 
305 310 315 320 

Asp Asp Gin Arg Leu Leu Leu Pro His Trp Ala Lys Val Val Leu Thr 
325 330 335 

Asp Pro Glu Ala Ala Lys Tyr Val His Gly He Ala Val His Trp Tyr 
340 " 345 350 

Leu Asp Phe Leu Ala Pro Ala Lys Ala Thr Leu Gly Glu Thr His Arg 
355 360 365 

Leu Phe Pro Asn Thr Met Leu Phe Ala Ser Glu Ala Cys Val Gly Ser 
370 375 380 

Lys Phe Trp Glu Gin Ser Val Arg Leu Gly Ser Trp Asp Arg Gly Met 
385 390 395 400 

Gin Tyr Ser His Ser He He Thr Asn Leu Leu Tyr His Val Val Gly 
405 410 415 

Trp Thr Asp Trp Asn Leu Ala Leu Asn Pro Glu Gly Gly Pro Asn Trp 
420 425 430 

Val Arg Asn Phe Val Asp Ser Pro lie He Val Asp Val Thr Lys Asp 
435 440 445 

Thr Phe Tyr Lys Gin Pro Met Phe Tyr His Leu Gly His Phe Ser Lys 
450 455 460 

Phe He Pro Glu Gly Ser Gin Arg Val Gly Leu Val Ala Ser Gin Lys 
465 470 475 480 

Asn Asp Leu Asp Ala Val Ala Leu Met His Pro Asp Gly Ser Ala Val 
485 490 495 

Val Val Val Leu Asn Arg Ser Ser Lys Asp Val Pro Leu Thr He Lys 
500 505 510 

Asp Pro Ala Val Gly Phe Leu Glu Thr He Ser Pro Gly Tyr Ser He 
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515 520 525 

His Thr Tyr Leu Trp Arg Arg Gin Asn Ser Asp Tyr Lys Asp Asp Asp 
530 535 540 

Asp Lys 
545 

(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 463 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : unknown 
<D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "MeGA Promoter" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:5: 

CAATACGATA TTACCGAATA TT AT AC T AAA TCAAAATTTA ATTTATCATA TCAATTATTA 60 

AACTGATATT TCAAATTTTA ATATTTAATA TCTACTTTCA ACTATTATTA CCTAATTATC 120 

AAATGCAAAA TGTATGAGTT ATTTCATAAT AGCCCAGTTC GTATCCAAAT ATTTTACACT 180 

TGACCAGTCA ACTTGACTAT ATAAAACTTT ACTTCAAAAA ATTAAAAAAA AAAGAAAGTA 240 

TATTATTGTA AAAGATAATA CTCCATTCAA AATATAAAAT GAAAAAAGTC CAGCGCGGCA 300 

ACCGGGTTCC TATAAATACA TTTCCTACAT CTTCTCTTCT CCTCACATCC CAT C ACTCTT 360 

CTTTTAACAA TTATACTTGT CAATCATCAA TCCCACAAAC AACACTTTTT CTCTCCTCTT 420 

TTTCCTCACC GGCGGCAGAC TTACCGGTGA AAGTAAGCAG STC 463 
(2) INFORMATION FOR SEQ ID NO: 6: 

(i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "PCR primer" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 
CTAGTCTAGA ATGCGTCCCC TGCGCCCCCG CG 
(2) INFORMATION FOR SEQ ID NO: 7: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: other nucleic acid 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 
GGAATTCGAG CTCTCATGGA TTGCCCGGGG ATG 
(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2067 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : unknown 

( D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 



ATGCGTCCCC 


TGCGCCCCCG 


CGCCGCGCTG 


CTGGCGCTCC 


TGGCCTCGCT 


CCTGGCCGCG 


60 


CCCCCGGTGG 


CCCCGGCCGA 


GGCCCCGCAC 


CTGGTGCAGG 


TGGACGCGGC 


CCGCGCGCTG 


120 


TGGCCCCTGC 


GGCGCTTCTG 


GAGGAGCACA 


GGCTTCTGCC 


CCCCGCTGCC 


ACACAGCCAG 


180 


GCTGACCAGT 


ACGTCCTCAG 


CTGOGACCAG 


CAGCTCAACC 


TCGCCTATGT 


GGGCGCCGTC 


240 


CCTCACCGCG 


GCATCAAGCA 


GGTCCGGACC 


CACTGGCTGC 


TGG AG CTTGT 


CACCACCAGG 


300 


GGGTCCACTG 


GACGGGGCCT 


GAGCTACAAC 


TTCACCCACC 


TGGACGGGTA 


CTTGGACCTT 


360 


CTCAGGGAGA 


ACCAGCTCCT 


CCCAGGGTTT 


GAGCTGATGG 


GCAGCGCCTC 


GGGCCACTTC 


420 


ACTGACTTTG 


AGGACAAGCA 


GCAGGTGTTT 


GAGTGGAAGG 


ACTTGGTCTC 


CAGCCTGGCC 


480 


AGGAGATACA 


TCGGTAGGTA 


CGGACTGGCG 


CATGTTTCCA 


AG TGG AA CTT 


CGAGACGTGG 


540 


AATGAGCCAG 


ACCACCACGA 


CTTTGACAAC 


GTCTCCATGA 


CCATGCAAGG 


CTTCCTGAAC 


600 


TACTACGATG 


CCTGCTCGGA 


GGGTCTGCGC 


GCCGCCAGCC 


CCGCCCTGCG 


GCTGGGAGGC 


660 


CCCGGCGACT 


CCTTCCACAC 


CCCACCGCGA 


TCCCCGCTGA 


GCTGGGGCCT 


CCTGCGCCAC 


720 


TGCCACGACG 


GTACCAACTT 


CTT CACTGGG 


GAGGCGGGCG 


TGCGGCTGGA 


CTACATCTCC 


780 


CTCCACAGGA 


AGGGTGCGCG 


CAGCTCCATC 


TCCATCCTGG 


AGCAGGAGAA 


GGTCGTCGCG 


840 


CACGAGATCC 


GGCAGCTCTT 


CCCCAAGTTC 


GCGGACACCC 


CCATTTACAA 


CGACGAGGCG 


900 


GACCCGCTGG 


TGGGCTGGTC 


CCTGCCACAG 


CCG TGGAGGG 


CGGACGTGAC 


CTACGCGGCC 


960 


ATGGTGGTGA 


AGGTCATCGC 


GCAGCATCAG 


AACCTGCTAC 


TGGCCAACAC 


CACCTCCGCC 


1020 


TTCCCCTACG 


CGCTCCTGAG 


CAACGACAAT 


GCCTTCCTGA 


GCTACCACCC 


GCACCCCTTC 


1080 


GCGCAGCGCA 


CGCTCACCGC 


GCGCTTCCAG 


GTCAACAACA 


CCCGCCCGCC 


GCACGTGCAG 


1140 


CTGTTGCGCA 


AGCCGGTGCT 


CACGGCCATG 


GGGCTGCTGG 


CGCTGCTGGA 


TGAGGAGCAG 


1200 


CTCTGGGCCG 


AAGTGTCGCA 


GGCCGGGACC 


GTCCTGGACA 


GCAACCACAC 


GGTGGGCGTC 


1260 


CTGGCCAGCG 


CCCACCGCCC 


CCAGGGCCCG 


GCCGACGCCT 


GGCGCGCCGC 


GGTGCTGATC 


1320 
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TACGCGAGCG 


ACGACACCCG 


CGCCCACCCC 


AACCGCAGCG 


TCGCGGTGAC 


CCTGCGGCTG 


1380 


CGCGGGGTGC 


CCCCCGGCCC 


GGGCCTGGTC 


TACGTCACGC 


GCTACCTGGA 


CAACGGGCTC 


1440 


TGCAGCCCCG 


ACGGCGAGTG 


GCGGCGCCTG 


GGCCGGCCCG 


TCTTCCCCAC 


GGCAGAGCAG 


1500 


TTCCGGCGCA 


TGCGCGCGGC 


TGAGGACCCG 


GTGGCCGCGG 


CGCCCCGCCC 


CTTACCCGCC 


1560 


GGCGGCCGCC 


TGACCCTGCG 


CCCCGCGCTG 


CGGCTGCCGT 


CGCTTTTGCT 


GGTGCACGTG 


1620 


TGTGCGCGCC 


CCGAGAAGCC 


GCCCGGGCAG 


GTCACGCGGC 


TCCGCGCCCT 


GCCCCTGACC 


1680 


CAAGGGCAGC 


TGGTTCTGGT 


CTGGTCGGAT 


GAACACGTGG 


GCTCCAAGTG 


CCTGTGGACA 


1740 


TACGAGATCC 


AGTTCTCTCA 


GGACGGTAAG 


GCGTACACCC 


CGGTCAGCAG 


GAAGCCATCG 


1800 


ACCTTCAACC 


TCTTTGTGTT 


CAGCCCAGAC 


ACAGGTGCTG 


TCTCTGGCTC 


CTACCGAGTT 


1860 


CG AG CCCTGG 


ACT ACTGGG C 


CCGACCAGGC 


CCCTTCTCGG 


ACCCTGTGCC 


GTACCTGGAG 


1920 


GTCCCTGTGC 


CAAGAGGGCC 


CCCATCCCCG 


GGCAATCCAT 


GAGCCTGTGC 


TGAGCCCCAG 


1980 


TGGGTTGCAC 


CTCCACCGGC 


AGTCAGCGAG 


CTGGGGCTGC 


ACTGTGCCCA 


TGCTGCCCTC 


2040 


CCATCACCCC 


CTTTGCAATA 


TATTTTT 








2067 



(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 653 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
(D> TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 



(xi> SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

Met Arg Pro Leu Arg Pro Arg Ala Ala Leu Leu Ala Leu Leu Ala Ser 
1 5 10 15 

Leu Leu Ala Ala Pro Pro Val Ala Pro Ala Glu Ala Pro His Leu Val 
20 25 30 

His Val Asp Ala Ala Arg Ala Leu Trp Pro Leu Arg Arg Phe Trp Arg 
35 40 45 

Ser Thr Gly Phe Cys Pro Pro Leu Pro His Ser Gin Ala Asp Gin Tyr 
50 55 60 

Val Leu Ser Trp Asp Gin Gin Leu Asn Leu Ala Tyr Val Gly Ala Val 
65 70 75 80 

Pro His Arg Gly lie Lys Gin Val Arg Thr His Trp Leu Leu Glu Leu 
85 90 95 

Val Thr Thr Arg Gly Ser Thr Gly Arg Gly Leu Ser Tyr Asn Phe Thr 
100 105 110 

His Leu Asp Gly Thr Leu Asp Leu Leu Arg Glu Asn Gin Leu Leu Pro 
115 120 125 

Gly Phe Glu Leu Met Gly Ser Ala Ser Gly His Phe Thr Asp Phe Glu 
130 135 140 
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Asp Lys Gin Gin Val Phe Glu Trp Lys Asp Leu Val Ser Ser Leu Ala 
145 150 155 160 

Arg Arg Tyr He Gly Arg Tyr Gly Leu Ala His Val Ser Lys Trp Asn 
165 170 175 

Phe Glu Thr Trp Asn Glu Pro Asp His His Asp Phe Asp Asn Val Ser 
180 185 190 

Met Thr Met Gin Gly Phe Leu Asn Tyr Tyr Asp Ala Cys Ser Glu Gly 
195 200 205 

Leu Arg Ala Ala Ser Pro Ala Leu Arg Leu Gly Gly Pro Gly Asp Ser 
210 215 220 

Phe His Thr Pro Pro Arg Ser Pro Leu Ser Trp Gly Leu Leu Arg His 
225 230 235 240 

Cys His Asp Gly Thr Asn Phe Phe Thr Gly Glu Ala Gly Val Arg Leu 
245 250 255 

Asp Tyr lie Ser Leu His Arg Lys Gly Ala Arg Ser Ser He Ser He 
260 265 270 

Leu Glu Gin Glu Lys Val Val Ala Gin Glu He Arg Gin Leu Phe Pro 
275 ' 280 285 

Lys Phe Ala Asp Thr Pro He Tyr Asn Asp Glu Ala Asp Pro Leu Val 
290 295 300 

Gly Trp Ser Leu Pro Gin Pro Trp Arg Ala Asp Val Thr Tyr Ala Ala 
305 310 315 320 

Met Val Val Lys Val He Ala Gin His Gin Asn Leu Leu Leu Ala Asn 
325 330 335 

Thr Thr Ser Ala Phe Pro Tyr Ala Leu Leu Ser Asn Asp Asn Ala Phe 
340 345 350 

Leu Ser Tyr His Pro His Pro Phe Ala Gin Arg Thr Leu Thr Ala Arg 
355 360 365 

Phe Gin Val Asn Asn Thr Arg Pro Pro His Val Gin Leu Leu Arg Lys 
370 375 380 

Pro Val Leu Thr Ala Met Gly Leu Leu Ala Leu Leu Asp Glu Glu Gin 
385 390 395 400 

Leu Trp Ala Glu Val Ser Gin Ala Gly Thr Val Leu Asp Ser Asn His 
405 410 415 

Thr Val Gly Val Leu Ala Ser Ala His Arg Pro Gin Gly Pro Ala Asp 
420 425 430 

Ala Trp Arg Ala Ala Val Leu He Tyr Ala Ser Asp Asp Thr Arg Ala 
435 440 445 

His Pro Asn Arg Ser val Ala Val Thr Leu Arg Leu Arg Gly Val Pro 
450 455 460 

Pro Gly Pro Gly Leu Val Tyr Val Thr Arg Tyr Leu Asp Asn Gly Leu 
465 " 470 475 480 

Cys Ser Pro Asp Gly Glu Trp Arg Arg Leu Gly Arg Pro Val Phe Pro 
485 490 495 

Thr Ala Glu Gin Phe Arg Arg Met Arg Ala Ala Glu Asp Pro Val Ala 
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500 505 510 

Ala Ala Pro Arg Pro Leu Pro Ala Gly Gly Arg Leu Thr Leu Arg Pro 
515 520 525 

Ala Leu Arg Leu Pro Ser Leu Leu Leu Val His Val Cys Ala Arg Pro 
530 535 540 

Glu Lys Pro Pro Gly Gin Val Thr Arg Leu Arg Ala Leu Pro Leu Thr 
545 550 555 560 

Gin Gly Gin Leu Val Leu Val Trp Ser Asp Glu His Val Gly Ser Lys 
565 570 575 

Cys Leu Trp Thr Tyr Glu lie Gin Phe Ser Gin Asp Gly Lys Ala Tyr 
580 585 590 

Thr Pro Val Ser Arg Lys Pro Ser Thr Phe Asn Leu Phe Val Phe Ser 
595 600 605 

Pro Asp Thr Gly Ala Val Ser Gly Ser Tyr Arg Val Arg Ala Leu Asp 
610 615 620 

Tyr Trp Ala Arg Pro Gly Pro Phe Ser Asp Pro Val Pro Tyr Leu Glu 
625 630 635 640 

Val Pro Val Pro Arg Gly Pro Pro Ser Pro Gly Asn Pro 
645 650 

(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 amino acids 

(B) TYPE : amino acid 

<C) STRANDEDNESS : Bingle 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Asp Tyr Lys Asp Asp Asp Asp Lys 
1 5 

(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: unknown 

( D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc - "PCR primer" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 
GCCTATGCTG AGCACAAGTT ACAG 
<2) INFORMATION FOR SEQ ID NO: 12: 
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(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 34 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS : unknown 

( D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "Complementary sequence of a 
PCR primer" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 
TCCCTTGAGC TCGTCACTGG CGACGCCACA GGTA 34 
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WHAT IS CLAIMED IS : 

1. A method for producing an enzymatically active 
lysosomal enzyme or modified lysosomal enzyme in a transgenic 
plant, comprising: 
5 (a) growing the transgenic plant which has a 

recombinant expression construct comprising a 
nucleotide sequence encoding the lysosomal enzyme 
or modified lysosomal enzyme and a promoter that 
regulates expression of the nucleotide sequence so 
10 that the lysosomal enzyme or modified lysosomal 

enzyme is expressed by the transgenic plant; and 
(b) recovering the lysosomal enzyme or modified 

lysosomal enzyme from an organ of the transgenic 
plant ; 

15 wherein the modified lysosomal enzyme has the amino acid 
sequence of the lysosomal enzyme with one or several amino 
acid substitutions, additions and/or deletions, and the organ 
is a leaf, stem, root, flower, fruit or seed. 



20 2. The method according to claim 1, in which the 

promoter is an inducible promoter, and which method 
additionally comprises, between steps (a) and (b) , the step 
of inducing the inducible promoter before or after the 
transgenic plant is harvested. 

25 

3. The method according to claim 2, in which the 
inducible promoter is induced by mechanical gene activation. 

4. The method according to claim 3, in which the 
30 inducible promoter comprises SEQ ID NO: 5. 



5. The method according to claim 1, in which the 
modified lysosomal enzyme comprises a detectable marker 
peptide fused to the amino or carboxyl terminal of the 
3 5 lysosomal enzyme. 
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6. The method according to claim 5, in which the 
detectable marker peptide comprises SEQ ID NO: 10. 

7. The method according to claim 1, in which the 
5 transgenic plant is a transgenic tobacco plant. 

8. The method according to any of claims 1, 4 and 7, 
in which the lysosomal enzyme or modified lysosomal enzyme is 
a human lysosomal enzyme or modified human lysosomal enzyme. 

10 

9. The method according to claim 8, in which the human 
lysosomal enzyme or modified human lysosomal enzyme is a 
glucocerebrosidase , modified glucocerebrosidase , a?-L- 
iduronidase or modified a-L- iduronidase . 

15 

10. A recombinant expression construct comprising a 
nucleotide sequence encoding a lysosomal enzyme or modified 
lysosomal enzyme and a promoter that regulates the expression 
of the nucleotide sequence in a plant cell, wherein the 

20 modified lysosomal enzyme has the amino acid sequence of the 
lysosomal enzyme with one or more amino acid substitutions, 
additions and/or deletions. 

11. The recombinant expression construct of claim 10, 
25 in which the promoter is an inducible promoter. 

12. The recombinant expression construct of claim 11, 
in which the inducible promoter is induced by mechanical gene 
activation . 

30 

13. The recombinant expression construct of claim 12, 
in which the inducible promoter comprises SEQ ID NO : 5 . 

14. The recombinant expression construct of claim 10, 

3 5 in which the modified lysosomal enzyme comprises a detectable 
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marker peptide fused to the amino or carboxyl terminal of the 
lysosomal enzyme . 

15. The recombinant expression construct of claim 14, 
5 in which the detectable marker peptide comprises SEQ ID 

NO: 10. 

16 . The recombinant expression construct of claim 10 or 
13, in which the lysosomal enzyme or modified lysosomal 

10 enzyme is a human lysosomal enzyme or modified human 
lysosomal enzyme . 

17. The recombinant expression construct of claim 16, 
in which the human lysosomal enzyme or modified human 

15 lysosomal enzyme is a glucocerebrosidase , modified 
glucocerebrosidase , a-L-iduronidase or modified a-L- 
iduronidase . 

18. A plant transformation vector comprising the 
20 recombinant expression construct of claim 16. 

19. A plant transformation vector comprising the 
recombinant expression construct of claim 17. 

25 20. A plant cell, tissue or organ which has the 

recombinant expression construct of claim 16. 

21. A plant cell, tissue or organ which has the 
recombinant expression construct of claim 17. 

30 

22. A plasmid CTProl : hGC : FLAG having the ATCC accession 
number 97 277. 

23. A plasmid pCT22 having the ATCC accession number 
35 97701. 
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24 . A plasmid pCT54 having the ATCC accession number 



5 25 . A transgenic plant or plant cell capable of 

producing an enzymatically active lysosomal enzyme or 
modified lysosomal enzyme, which transgenic plant or plant 
cell has a recombinant expression construct comprising a 
nucleotide sequence encoding a lysosomal enzyme or modified 
10 lysosomal enzyme and a promoter that regulates expression of 
the nucleotide sequence in the transgenic plant or plant 
cell, wherein the modified lysosomal enzyme has the amino 
acid sequence of the lysosomal enzyme with one or more amino 
acid substitutions, additions and/or deletions. 

15 

26. The transgenic plant or plant cell of claim 25, in 
which the promoter is an inducible promoter. 

27. The transgenic plant or plant cell of claim 26, in 
20 which the inducible promoter is induced by mechanical gene 

activation. 

28. The transgenic plant or plant cell of claim 27, in 
which the inducible promoter comprises SEQ ID NO: 5. 

25 

29. The transgenic plant or plant cell of claim 25, in 
which the modified lysosomal enzyme comprises a detectable 
marker peptide fused to the amino or carboxyl terminal of the 
lysosomal enzyme . 

30 

30. The transgenic plant or plant cell of claim 29, in 
which the detectable marker peptide comprises SEQ ID NO: 10. 

31. The transgenic plant or plant cell of claim 25, in 
35 which the transgenic plant or plant cell is a transgenic 

tobacco plant or tobacco cell . 
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32. The transgenic plant or plant cell of any of claims 
25, 28 and 31, in which the lysosomal enzyme or modified 
lysosomal enzyme is a human lysosomal enzyme or modified 
human lysosomal enzyme . 

5 

33. The transgenic plant or plant cell of claim 32, in 
which the human lysosomal enzyme or modified human lysosomal 
enzyme is a glucocerebrosidase , modified glucocerebrosidase , 
a-L-iduronidase or modified c*-L- iduronidase . 

10 

34. A leaf, stem, root, flower or seed of the 
transgenic plant of claim 32 . 

35. A leaf, stem, root, flower or seed of the 
15 transgenic plant of claim 33. 

36. A plant grown from a seed of plant line X-ll, which 
seed has the ATCC Accession No. 97275. 

20 37. A plant grown from a seed of plant line X-27, which 

seed has the ATCC Accession No. 97276. 

38. A plant grown from a seed of plant line CT40-9, 
which seed has the ATCC Accession No. 97700. 

25 

39. A lysosomal enzyme or modified lysosomal enzyme 
which is enzymatically active and is produced according to a 
process comprising: 

(a) growing a transgenic plant which has a recombinant 
3 0 expression construct comprising a nucleotide 

sequence encoding the lysosomal enzyme or modified 
lysosomal enzyme and a promoter that regulates 
expression of the nucleotide sequence so that the 
lysosomal enzyme or modified lysosomal enzyme is 
3 5 expressed by the transgenic plant; and 
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(b) recovering the lysosomal enzyme or modified 

lysosomal enzyme from an organ of the transgenic 
plant ; 

wherein the modified lysosomal enzyme has the amino acid 
5 sequence of the lysosomal enzyme with one or more amino acid 
substitutions, additions and/or deletions, and the organ is a 
leaf, stem, root, flower, fruit or seed. 

40. The lysosomal enzyme or modified lysosomal enzyme 
10 of claim 39, in which the promoter is an inducible promoter, 
and which process additionally comprises, between steps (a) 
and (b) , the step of inducing the inducible promoter before 
or after the transgenic plant is harvested. 

15 41> T he lysosomal enzyme or modified lysosomal enzyme 

of claim 40, in which the inducible promoter comprises SEQ ID 
NO: 5 . 

42. The lysosomal enzyme or modified lysosomal enzyme 
20 of claim 39, in which the modified lysosomal enzyme comprises 

a detectable marker peptide fused to the amino or carboxyl 
terminal of the lysosomal enzyme. 

43. The lysosomal enzyme or modified lysosomal enzyme 
25 of claim 42, in which the detectable marker peptide comprises 

SEQ ID NO: 10 . 

44. The lysosomal enzyme or modified lysosomal enzyme 
of claim 39, in which the transgenic plant is a transgenic 

30 tobacco plant. 

45. The lysosomal enzyme or modified lysosomal enzyme 
of any of claims 39, 41 and 44, in which the lysosomal enzyme 
or modified lysosomal enzyme is a human lysosomal enzyme or 

35 modified human lysosomal enzyme. 
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46. The lysosomal enzyme or modified lysosomal enzyme 
of claim 45, in which the human lysosomal enzyme or modified 
human lysosomal enzyme is a glucocerebrosidase or modified 
glucocerebrosidase, a-L- iduronidase or modified a-L- 
5 iduronidase . 



10 



15 



20 



25 



30 



35 
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MEFSSPSREE CPKPLSRVS IMAGSLTGLL LLQAVSWASG ARPCIPKSFG 

51 100 
YSSVVCVCNA TYCDSFDPP TFPALGTFSR YESTRSGRRM ELSMGPIQAN 

101 150 
HTGTGLLLTL QPEQKFQKV KGFGGAMTDA AALNILALSP PAQNLLLKSY 

151 200 

FSEEGIGYNI IRVPMASCD FSIRTYTYAD TPDDFQLHNF SLPEEDTKLK 
201 250 

1PLIHRALQL AQRPVSLLA SPWTSPTWLK TNGAVNGKGS LKGQPGDIYH 

251 300 
QTWARYFVKF LDAYAEHKL QFWAVTAENE PSAGLLSGYP FQCLGFTPEH 

301 350 
QRDFIARDLG PTLANSTHH NVRLLMLDDQ RLLLPHWAKV VLTDPEAAKY 

351 400 
VHGIAVHWYL DFLAPAKAT LGETHRLFPN TMLFASEACV GSKFWEQSVR 

401 450 
LGSWDRGMQY SHSI ITNLL YHWGWTDWN LALNPEGGPN WVRNFVDSPI 

451 500 

I VDVTKDTFY KQPMFYHLG HFSKFIPEGS QRVGLVASQK NDLDAVALMH 

501~ 550 

PDGSAVWVL NRSSKDVPL TIKDPAVGFL ETISPGYSIH TYLWRRQnsd 

ykddddk" 
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